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© Synthetic urogastrone gene, corresponding plasmid recombinants, transformed cells, production thereof and 
urogastrone expression. 

© A synthetic gene characterised in that it codes for the 
expression of urogastrone or a sub-unit thereof is disclosed. 

The production thereof by the assembly and ligation of a 
number of nucleotide blocks is also disclosed, as are 
corresponding plasmid recombinants, transformed cells and 
the production thereof. 

The expression of urogastrone is further disclosed. 
Urogastrone is a polypeptide hormone (protein) which 
O ma V be isolated in small amounts from human urine. It has 
an application in the treatment of ulcers and in the promo- 
& tion of wound healing. The present invention provides a 
UJ more viable commercial production. 
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"Synthetic urogastrone gene, corresponding plasmid 
recombinants , transformed cells, production thereof 
and urogastrone expression" 



This invention relates to a synthetic urogastrone 
gene, to corresponding plasmid recombinants and transformed 
cells, to the production thereof and to urogastrone expression 

Urogastrone is a polypeptide* hormone (protein) 
synthesised in the duodenum and in the salivary glands of 
normal humans, (see, for example, Keitz, et al_, (1978) , 
Gut, 19 , 408-413) . Urogastrone suppresses-. ; the secretion 
of gastric acid and promotes cell growth, (see, for example. 
Elder, et al, (1975), Gut, 16, 887-893). Therefore, it has 
an application, in the treatment of ulcers and in the 
promotion of wound healing. Urogastrone is excreted in 

* 

small amounts in human urine and may be isolated therefrom. 
There exists, however, a need for a more viable commercial 
production thereof and such is provided according to the 
present invention. 
10 Urogastrone is known to consist of 53 amino 

acids in the following sequence; 
Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp - 
Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr - 
lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys - 
25 Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr - 
Arg Asp Leu Lys Trp Trp Glu Leu Arg 

(see, for example, Gregory, H. , and Preston, B.M., (1977), 
Int. J. Peptide Protein Res., 9_, 107-118.) 

From the above amino acid sequence, a 

30 corresponding synthetic gene sequence has been invented, 
subject to a number of specific non-obvious criteria, and 
oligonucleotide blocks synthesised which, when assembled, 
form a synthetic gene coding for urogastrone. The blocks 
have been hybridised and ligated in pre-determined stages 

35 to construct the urogastrone gene in two portions. These 
have been cloned in two operations into a new specifically 
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designed chimeric E. coli/S. aureus vector so as to 
produce a full length urogastrone gene flanked only. by 
E. coli plasmid DMA. The gene has been excised from 
this recombinant and re-cloned into vectors specifically 
designed to maximise expression of the gene in E. coli, 
under the control of the promoter obtained from the 
E. coli tryptophan operon. A protein resembling human 
urogastrone has thus been expressed in E. coli. 

From the above amino acid sequence, because 
of the degeneracy of the genetic code, it is possible 
to predict numerous nucleotide sequences which would 

code for the protein. 

In the inventive determination of an optimum 
sequence from the large number of possibilities, several 
non-obvious criteria have been observed. Firstly, 
trinucleotide codons should be used which are acceptable 
or preferable in the cells to be used, in particular 
E. coli . Secondly, it was decided that it was desirable 
to have different restriction enzyme recognition sites 
5 at the termini of the molecule so as to allow insertion • 
into a plasmid in a desired orientation. . Moreover, it 
was decided to select sites which allowed the use of 
well-understood cloning vectors, such as pBR322 (see, 
for example. Bolivar, F. , ft al , (1977), Gene, 2, 95-113). 
25 In fact, Hind . Ill and Bam HI sites were selected and 

introduced at the 5« and 3» ends, respectively. Thirdly, 
it was thought desirable to introduce a series of 
restriction endonuclease recognition sites strategically 
placed along the molecule to enable the gene to be 
30 specifically disected to aid characterisation and, 

possibly mutagenesis. Also, this measure allowed the 
two portions of the molecule to" be cloned in stages. In 
particular, an Xba I site was introduced at a central 
location in the gene. Fourthly, the synthesis should 
35 not be unnecessarily complicated and illegitimate cross- 
hybridisations should be minimised in order to • facilitate 
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gene assembly. Using a computer, energies of interactions 
arising from all possible approximations were calculated 
(see, for example, Tinoco, Jr., I., et al , (1971), Nature, 
230, 362-367; Powers, G.J., et al, (1975), JACS, 97, 
875-889) , so that stable off-diagonal interactions might 
be avoided where possible. Fifthly, since the protein 
ultimately expressed in bacterial cells will be in the 
form of a fusion product, it was desirable to have a means 
of cleaving the urogastrone portion from such a fusion 
product. Since urogastrone is known to be insensitive to 
trypsin (see, for example, Gregory and Preston, loc cit) , 
the codons specifying the dipeptide lys-lys were introduced 
near the end of the gene corresponding to the urogastrone 
N-terminus in order to serve as a substrate for tryptic 

proteolysis . 

The present invention relates to a synthetic 

gene characterised in that it codes for the expression 
of urogastrone or a sub-unit thereof. . 

The particular preferred sequence selected for 
20 the coding portion of the synthetic urogastrone gene is 
as follows: 

5' AATTCCGATAGCGAGTGTCCTCTG- 
3* TTAAGGCTATCGCTCACAGGAGAC- 
AGTCACGATGGTTACTGTCTACATGACGG- 
25 TCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAGGCTCTAGACA- 
GCAGACATACATATAACTCCGAGATCTGT- 
AGTACGCGTGTAATTGCGTTGTTGGCTAC- 
TCATGCGCACATTAACGCAACAACCGATG- 
30 ATCGGTGAGCGCTGTCAGTATCGAGATCT- 
TAGCCACTCGCGACAGTCATAGCTCTAGA- 

GAAATGGTGGGAACTTAGA 
CTTTACCACCCTTGAATCT 

One aspect of the present invention is directed 
35 accordingly. A sub-unit of such a sequence also constitutes 
an aspect of the present invention. 



In fact, bearing in mind, inter alia , the 
above-mentioned considerations, for convenience, a 
slightly longer sequence was selected which is as 
follows: 

Coding: AGCTTAAAAAGAATTCCGATAGCGA G;T 
Non-Coding: ATTTTTCTTAAGGCTATCGCTCA 
GTCCTCTGAGTCACGATGGTTACTGTCT- 
CAGGAGACTCAGTGCTACCAATGACAGA- 
ACATGACGGCGTCTGTATGTATATTGAG- 
TGTACTGCCGCAGACATACATATAACTC- 
GCTCTAGACAAGTACGCGTGTAATTGCG- 
CG-AGATCTGTTCATGCGCACATTAACGC- 
TTGTTGGCTACATCGGTGAGCGCTGTCA- 
AACAACCGATGTAGCCACTCGCGACAGT — 
. GTATCGAGATCTGAAATGGTGGGAACTT,- 
CATAGCTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

The whole of this expanded sequence is shown in 
Figure 1 of the accompanying drawings and the restriction 
enzyme recognition sites are shown in Figure 2 of the 
accompanying drawings* 

Another aspect of the present invention is 
directed accordingly. A sub-unit of such a sequence also 
constitutes an aspect of the present invention. 

In the above sequences, it is to be appreciated 
that minor variations are possible without departing from 
the present inventive concept.. 

The present invention also relates to a process 
for the production of such a synthetic gene or a sub- 
unit thereof characterised in that it comprises the 
assembly and ligation of a number of oligonucleotide 
blocks. 

It was in fact decided to synthesise a molecule 
having the above expanded sequence by making23 synthetic 
oligonucleotide blocks as illustrated in Figure 3 of the 
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accompanying drawings, which will assemble by single- 
strand overlaps to give the complete double-stranded 
nucleotide sequence . 

In order to minimise strong illegitimate 
interactions near the centre of the molecule during 
assembly, certain blocks of modified sequence have also 
been synthesised, as illustrated in Figure 4 of the 
accompanying drawings. 

As mentioned above, the synthetic blocks 
selected are shown in Figure 3 of the accompanying 
drawings • The blocks may be constructed using known 
synthesis techniques (see, for example, Agarwal, et al , 
(1970), Nature, 227 , 27-34? and Crea, et al, (1978), Proc. 
Natl. Acad. Sci. USA , 75, 5765-5769) . 

The synthetic methods will now be illustrated 
with reference to the synthesis of the tetradecanucleotide 
ApGpTpTpCpCpCpApCpCpApTpTpT . 

The methods of building up oligonucleotides 
from smaller units by successive coupling reactions are well 
known (see, for example, Hsiung, et al, (1979), Nucleic 
Acid Research, 1371-1385) . The completely protected 
tetradecanucleotide was built up as shown in Figure 
5 of the accompanying drawings (wherein, for convenience, 
protecting groups are not shown) • 

The condensation reactions indicated by arrows 
in Figure 5 were carried out by the following procedure 

Bz iBu 

exemplified in the synthesis of: DMTr A p G p-CNEt 

0C1 0C1 

as shown in Figure 6:* of; the accompanying drawings.. From* ; 
• 1*1 to 1.5 ramole of the 3 ' -phosphodiester component (I) 

was condensed with 1.0 iranole of the 5' -hydroxy 1 component 
(II) in anhydrous pyridine in the presence of from 3 to 
4.5 mmoles of 2,4,6-triisopropyl-benzene-sulphonyl- 
tetrazolide. The reaction was left for 1 hour at room 
temperature or until chromatography on silica TLC plates 
eluted with 10% (v/v) methanol in chlorofrom showed that 
the hydroxy 1 component was exhausted The reaction was 
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quenched with 5% (w/y) sodium bicarbonate solution and 
extracted with chloroform. The chloroform extract was 
dried and loaded onto a reverse phase chromatography 
column (ODS bonded to 15-25 micron silica) . The fully . 
protected dinucleotide product (III) was eluted with a 
solvent gradient from chloroform:methanol: water (2:6:3 v/v) 
to chloroform:methanol:water (2:6:0.5 v/v). The product 
(III) was extracted into chloroform and dried. The 
final isolated yield was 81%. 

To proceed to further condensations, the 
terminal protecting group (DMTr or CNEt) was removed 
selectively using triethylamine in pyridine (CNEt) or a 
2% (w/v) solution of benzene sulphonic acid in chloroform: 
methanol (DMTr) as shown in Figure 7 of the accompanying 
15 drawings. 

At the completion of the synthesis, all of the 
protecting groups were removed by sequential treatment 
with 0.1 M tetraethylammonium fluoride in THF/pyridine/ 
water (8:1:1 v/v), ammonia and 80% acetic acid. The 
deprotected oligonucleotides were purified by ion-exchange 
HPLC and sequence analysis was carried out by the method 
of Wu, et al , (1976), Anal. Biochem., 7±, 73-93. 

The oligomeric blocks of nucleotides .were 
hybridised and ligated (see, for example, Agarwal, et al , 
25 loc cit) in a series of steps, in order to minimise the 
possibilities for undesirable interactions, leading to 
the formation of the two portions as shown in Figure 8 
of the accompanying drawings. The order of the additions 
in the assembly scheme was-, optimised for minimal incorrect 
ligations and in the case of especially difficult oligo- 
meric blocks, notably 7 and 8, sub-molar quantities were 
used in order to remove all monomeric units before further 

additions were made. 

In more detail: 

35 Left-hand portion: Blocks 1 and 2 were ligated 

to form a dimer about the Hin d III site. Blocks 4, 5 and 
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6 and 8, 9 and 10 were also ligated in the first round 
of the assembly scheme. Molar equivalents were used for 
all but block 8 where 0.75 molar equivalents were 
employed. 

Block 3 was ligated with 1 + 2, block 7 
(0.75 molar equivalent) with 4 to 6 and block 11 with 
8 to 10. The 8 + 9 + 10 + 11 assembly has one flush end, 
hence some blunt-end dimerisation was observed. 

1 to 3 and 4 to 7 were ligated and finally 8 
to 11 were ligated to the resulting 1 to 7 species . 

The dimeric 1 to 11 left-hand portion was then 
cleaved by Hind III (EC 3.1.23.21) and Xba I (EC 3.1.23.4) 

* 

to generate the monomeric left-hand portion, with the 
correct cohesive termini to allow construction of 
recombinant plasmids. 

Right-hand portion: Blocks 12 and 13 were 
ligated to form a dimer about the Xba I site and blocks 
20, 23 and 22 similarly ligated to form a: dimer about the 
Bam HI site. Blocks 14, 15 and 17, and 16, 18 and 19 were 
also ligated at this stage. 

The 12, 13 dimer and the 14, 15, 17 assembly 
were ligated, as were the 16, 18, 19 assembly with the 20, 
22, 23 dimer, where block 21 was used as a joining section. 

These two species were then ligated to give an 
oligomeric molecule which was cleaved by Xba I and Bam HI 
(EC 3.1.23.6) to give the monomer 12 to 23 species having 
the correct cohesive termini to allow construction of 
recombinant plasmids. 

The present invention further relates to a 
plasmid recombinant characterised in that it comprises a 
plasmid vector having inserted therein at an appropriate 
insertion site such a synthetic gene or a sub-unit thereof, 
the plasmid recombinant enabling translation. in the 
correct phase for the raRNA corresponding to the inserted 
gene or sub-unit thereof and having a- bacterial promoter 
upstream of and adjacent to the insertion site such that 
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the inserted gene or sub-unit thereof is under bacterial 

promoter control. 

A particular vector, designated pLF^l, has 
been specifically designed to afford advantages for the 
purposes of the present invention. The inventive plasmid, 
pLF 1, is a 5K bp plasmid which may be propagated in 
E. coli and which may be constructed from pBR_322 and 
pUB 110 (see, for example, Gryczan, T.J., et al , (1978), 
J. Bacterial., 134, 318) by inserting the DNA sequence 
of the S. aureus - plasmid P UB w 110 between the EcoRI and 
Bam HI sites thereof (comprising approximately 870 bp) 
lelween the EcoRI and Bam HI sites of pBR^322, thereby 
replacing that region of pBR^322." 

The present invention further relates to a 
process for the production of such a plasmid recombinant 
characterised in that it comprises inserting such a 
synthetic gene or a sub-unit thereof into an appropriate 
insertion site of an appropriate plasmid vector. 

The following illustrates the present 

20 invention: 

Phosphorylation of oligonucleotide blocks: 
each case, 6 ^g of oligomer and 60 ;iCi of [if- *3 ATP 
(>5000 Ci/mMol) were dried and redissolved in a final . 
buffer concentration of 50 mM Tris-HCl, pH 7.8, 10 mM 
25 MgCl 2 , 0.25 mM ATP, 10 mM &-mercaptoethanol and incubated 
with 4 units (1 unit is the amount that catalyses the 
production of 1 n mole of acid-insoluble P after 
incubation for 30 minutes at 37»C according to Richardson, 
C.C.,(1972), Progress in Nucleic Acids Research, 2, 815) 
of T4 polynucleotide kinase (EC 2.7.1.78, Bethesda Research 
Labs) at 37 °C for 15 minutes. The enzyme was subsequently 
inactivated by a 5 minute incubation at 100 °C. 

Ligation of oligonucleotide blocks: Except 
where indicated above, equimolar quantities (6 to 18 ;ig) 
35 of oligonucleotide blocks were incubated in 50 mM Tris- 
HCl, pH 7.8, 10 mM MgCl 2 , 1 mM ATP, 20 mM dithiothreitol 
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(DTT) with 6 units (l imit is the amount that catalyses 
the conversion of 1 n mole of 32 PPi into (a/0 32 P)-ATP 
in 20 minutes at 37 °C according to Weiss , B., et al , 

m 

(1968), J. Biol, Chem., 243 , 4543) of T4 DNA ligase 

(EC 6*5. 1.1, Bethesda Research Labs) at 25 °C for from 

3 to 16 hours. Ligated DNA was precipitated by addition 

of 2,5 vol absolute ethanol, collected by centrif ugation 

and redissolved in water. 

Purification of liqated species: Ligated 
were electrophoresed on 20% (w/v) polyacryl amide 
oligonucleotide blocks/in &0 mM Tris-HCl, pH 8.3, 90 mM 

boric acid, 2.5 mM EDTA (TBE buffer), and the fragments 

located by autoradiography. Slices of gel containing 

the fragments were excised and the DNA electroeluted 

at 1 mA in TOE buffer onto 0.5 ml of DEAE cellulose 

(DE52, Whatman) for a few hours. After extensive 

washing by 0.1MNH 4 OAc, 2 mM Mg(0Ac) 2 , 0.02% (w/v) SDS, 

0.02 mM EDTA (AGEB buffer), the DNA was eluted from 

the DEAE cellulose by 2 ml of 1.1 M NaCl in AGEB buffer 

and precipitated by addition of 2.5 vol. absolute 

ethanol. 

Construction of chimeric cloning vector pLF^l: 
In order to facilitate the two-stage cloning of 
urogastrone, it was desirable to construct a vector or 
vectors having Hin d III, Xba I and Bam HI cleavage 
sites, see Figure 9 of the accompanying drawings. 
However, no readily available E. coli plasmids possess 
Xba I sites. It was surprisingly noticed that the 
S- aureus plasmid pUB 110, which cannot be propagated in 
E. coli (see, for example, Keggins, K.M., et al (1978), 
Proc. Natl. Acad. Sci. U.S.A., 75, 1423), contains a 
region of about 1 kbp* of DNA bounded by sites for 
Eco RI and Bam HI having an approximately central Xba I 
site. Therefore, pUB^llO was cleaved with Ecp RI (EC 
3.1.4.32) and Bam HI and the DNA fragments electrophoresed 
on 5% (w/v) polyacryl amide. The approximately 1 kbp 
EcoRI /Bam HI fragment was removed by electroelution from 
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the excised gel slice onto DEAE cellulose, eluted by 
1.1 M NaCI and ethanol precipitated. The E. coli plasmid 
pAT 153 (see, for example, Twigg, A.J., and Sherratt, 
D "(1980) , Nature, 283 , 216-218) was also cleaved with 
EcoRI and Bam HI, and the 3282 bp fragment purified by 
"electroelution as above. These two purified DNA fragments 
were ligated in equimolar quantities using 10 units of 
T4 DMA ligase in 50 mM Tris -HC1, pH 7.8, 10 mM MgCl 2 , 
1 mM ATP, 20 mM DTT at 15°C for 18 hours. After ethanol 
precipitation, the ligated DNA was dissolved in 100 ;il 
10 mM Tris pH 7.5, 10 mM MgCl 2 , 10 mM CaCl 2 and _ _ 
transformed into E. coli K12 HB^lOl (genotype gal lac , 

- - ,„„~ -i.-.r rec A ~ , r ~ , mT; see, for example, 

ara , pro , arg , str , rec a *k' ' 

Boyer, H.W. r and Roullard-Dussoix, D., J. Mol. Biol., 
41, 459-472) using known methods (see, for example, Cohen, 
et al, (1972), Proc. Natl, Acad. Sci. U.S.A., 69, 2110- 
2114) 'and transformants resistant to 100 ug/ml ampicillin 
selected.. Several transformants were analysed further by. 
restriction enzyme cleavage of plasmid DNA, and one full 
length clone, designated pLF_l, selected for further use 
as a cloning vector. At a later stage, an additional 
Hin d III restriction sequence was created at the EcoRI 
site. 

Cloning of the synthetic urogastrone gene in 
pLF is The two portions of the assembled urogastrone 
gene -..-were cloned in two transformation stages, as illustrated 
in Figure 10 of the accompanying drawings. 

Right-hand portion: The longer fragment of 
Xba I, Bam Hl-cleaved pLF^l was purified by 
electroelution as above and ligated to a large excess of 
the zight-hand portion assembly of the urogastrone gene 
using 10 units T4 DNA ligase in 50 mM Tris-HCl, pH 7.8, 
10 mM MgCl 2 , 1 mM ATP, 20 mM DTT at 15°C for 18 hours. 
This was transformed into E. coli as before, with 
\ selection for 100 ug/ml ampicillin. Several transformants 
were selected for plasmid analysis by restriction enzymev 
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cleavage analysis and one clone, designated pUR 1, used 
for further cloning. The sequence of the inserted 
urogastrone gene DNA was confirmed by chemical degradation 
analysis (see, for example, Maxam, A., and Gilbert, W., 
(1977), Proc. Natl, Acad. Sci. U.S.A., 7£, 560-4)* 

In order to have a Hin d III site for ligation 
to the 5* end of the left-hand portion of the gene, 
the EcoRI site was modified as follows: pUR^l was 
cleaved with EcoR I and the resulting recessed ends filled 
using 5 units (1 unit is the amount that incorporates 
10 n moles of total nucleotides into an acid-precipitable 
fraction in 30 minutes at 37 °C using poly-d(A-T) as 

* 

primer according to Richardson, C.C., et al , (1964), 
J. Biol. Chera., 239 , 222) of DNA polymerase I (EC 2.7.7.7) 
in 50 mM Tris-HCl, pH 7.5, 10 mM MgCl 2 , 1 mM 8-mercapto- 
ethanol with 0.25 mM of each deoxynucleotide for 30 minutes 
at 15°C. 

To the resulting flush ends, were ligated a 
large excess of synthetic Hin d III linkers (Collaborative 
Research) by the known blunt end ligation procedure (see, 
for example, Ullrich, A., et al, (1977), Science, 196 , 
1313-1319) . The DNA was ligated using 6 units of T4 DNA 
ligase in 50 mM Tris-HCl, pH 7.8, 10 mM MgCl 2 , 1 mM ATP, 
20 mM DTT at 15 °C for 18 hours. 

After Hin d III restriction cleavage and 
purification, the full length plasmid was religated and 
transformed into E. coli as above, with selection for 
100 /ig/ml ampicillin. A transformant having a Hin d III 
site, designated pUR 2, was selected for further cloning. 

Left-hand portion: pUR^2 was cleaved with 
Hin d III and Xba I restriction enzymes and the longer 
fragment purified by electroelution as above. This was 
ligated to a two-fold molar excess of the assembled 
left-hand portion of the urogastrone gene using 6 units 
T4 DNA ligase in 50 mM Tris-HCl, pH 7.8, 10 mM MgCl 2 , 
1 mM ATP, 20 mM DTT for 18 hours at 15 °C. This was 
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transformed into E. coli K12 MRC 8 (genotype dap 103 
hsd R met Bl glm 533 upp 1 dap 101. sup E thy A 103 deo 
rec Al) with selection for 100 ^g/ml arapicillin. Several 
transformants were selected for plasmid analysis by 
restriction enzyme cleavage. One clone, designated pUR^I 
was used for further characterisation and cloning. The 
sequence of the full urogastrone gene was confirmed 
by chemical degradation analysis. It should be noted 
that pUR I has.no remaining S aureus DNA sequence 
present. 

The present invention also relates to a cell, 
in particular an E. coli cell, characterised in that it 
comprises inserted therein such a synthetic gene or a sub- 
unit thereof or such a plasmid recombinant. 

The present invention further relates to a 
process for the production of such a cell characterised 
in that it comprises inserting such a synthetic gene or 
a sub-unit thereof or such a plasmid recombinant into 
a cell. 

Expression of urogastrone in E. coli: The 
urogastrone gene insert was cleaved from pUR^l by Hind 
III and Bam HI cleavage and purified by polyacrylamide 
gel electrophoresis and electroelution as above. This 
fragment was ligated to Hind III, Bam Hl-cleaved pWT>21 
and pWT 221, (see, for example, Tacon, W.C.A., .et al , 
(1980) ,~Molec. Gen. Genet. 1T7, 427) and the recombinant 
molecules used to transform E. coli MRC 8, (see, for 
example, Emtage, J.S., et al , (1980), Nature, 283, 171- 
174) , with selection for 100 ^g/ml ampicillin. Trans- 
formants containing full length urogastrone genes were 
characterised by restriction enzyme cleavage analysis and 
DNA/purified by isopycnic centrifugation in caesium 
chloride. 

Expression of urogastrone-like fusion protein 
was induced by growth of cells in L-broth (luria broth: 
1% (w/v) bacto tryptone., 0.5% (w/v) bacto yeast extract, 
0.5% (w/v) NaCl, 0.2% (w/v) glucose, 0.004% (w/v) 
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thymine, pH 7) containing 100 ^ig/ral ampicillin to an 
A600 nra of 0.3. Following centrif ligation, the cells 
were washed and resuspended in M9 medium lacking 
tryptophan, but containing 20 ^g/ml 3 B-indole acrylic 
acid. The cells were incubated at 37 °C for 4 hours. 
Under these conditions maximal tryptophan promoter 
activity is known to occur (see Tacon, et loc cit) , 
and hence expression of the urogastrone fusion protein. 

The present invention also relates to a 
process for the production of urogastrone or a sub- 
unit thereof characterised in that it comprises *- \> 
culturing such a cell and recovering expressed protein. 

As mentioned above, urogastrone has an 
application in the treatment of ulcers, and also in other 
instances where the growth promoting activity thereof 
would be beneficial, for example, in wound healing. 
Conventional administration forms may be used, the active 
material being used in an effective amount, for example 
from 0.1 to 1.0 jig/kg body weight, preferably from 0.125 
to 0.5 Jig/kg body weight, more preferably about 0.25 
^ig/kg body weight, optimally together with a 
conventioanl pharmaceutical ly-acceptable carrier, diluent 
or adjuvant. 
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CLAIMS: 



1. a synthetic gene characterised in that it codes 
for the expression of urogastrone or a sub-unit thereof. 

2. A synthetic gene is claimed in claim 1 
characterised in that it comprises the following sequence 
or a sub-unit thereof ; 

Coding: AATTCCGATAGCGAGTGTCCTCTG — 

Non-coding: TTAAGGCTATCGCTCACAGGAGAC- 

AGTCACGATGGTTACTGTCTACATGACGGC- 
TCAGTGCTACCAATGACAGATGTACTGCCG- 

GTCTGTATGTATATTGAGGCTCTAGACAAG- 
CAGACATACATATAACTCCGAGATCTGTTC- 

TACGCGTGTAATTGCGTTGTTGGCTACATC- 
ATGCGCACATTAACGCAACAACCGATGTAG- 

GGTGAGCGCTGTCAGTATCGAGATCTGAAA- 
CCACTCGCGACAGTCATAGCTCTAGACTTT- 

TGGTGGGAACTTAGA 
ACCACCCTTGAAT C T 

3 m a synthetic gene as claimed in claim 1 or claim 2 

characterised in that it comprises the following sequence 
or a sub-unit thereof: 

Coding: AGCTTAAAAAGAATTCCGATAGCGAGT 

Non-coding: AT ttttcttaaggctatcgctca 

.GTCCTCTGAGTCACGATGGTTACTGTCT- 
CAGGAG'ACTCAGTGCTACCAATGACAGA- 
ACATGACGGCGTCTGTATGTATATTGAG- 
TGTACTGCCGCAGACATACATATAACTC- 

G C T C T A.G ACAAGTACGCGTGTAATTGCG- 
CGAGATCTGTTCATGCGCACATTAACGC- 
TTGTTGGCTACATCGGTGAGCGCTGTCA- 
AACAACCGATGTAGCCACTCGCGACAGT- 
GTATCGAGATCTGAAATGGTGGGAACTT- 
CATAGCTCTAGACTTTACCACCCTTGAA- 
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A G A T A A G 
TCTATTCCTAG 

4. A process for the production of a synthetic gene 
or a sub-unit thereof as claimed in any of claims 1 to 3 
characterised in that it comprises the assembly and 

r. ligation of a number of oligonucleotide blocks. 

5. A plasmid. recombinant characterised in that it 
comprises a plasmid vector having inserted therein at an 
appropriate insertion site a synthetic gene or a sub-unit 
thereof as claimed in any of claims 1 to 3, the plasmid 
recombinant enabling translation in the correct phase for 
the mRNA corresponding to the inserted gene or sub-unit 
thereof and having a bacterial promoter upstream of and 
adjacent to the insertion site such that the inserted gene 
or sub-unit thereof is under bacterial promoter control. 

6. A plasmid recombinant as claimed in claim 5 
characterised in that it comprises as the plasmid vector 
pLFl. 

7. A process for the production of a plasmid 
recombinant as claimed in claim 5 characterised in that it 
comprises inserting a synthetic gene or a sub-unit thereof 
as claimed in any of claims 1 to 3 into an appropriate 

* 

insertion site of an appropriate plasmid vector. 

8. A cell characterised in that it comprises 
inserted therein a. synthetic gene or a sub-unit thereof 

as claimed in any of claims 1 to 3 or a plasmid recombinant 
as claimed in claim 5 or claim 6. 

• 

9. A process for the production of a cell as claimed 
in claim 8 characterised in that it comprises inserting a 
synthetic gene or a sub-unit thereof as claimed in any of 
claims 1 to 3 or a plasmid recombinant as claimed in claim 

5 or claim 6 into a cell. 
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10. A process for the production of urogastrone or 

a sub-unit thereof characterised in that it comprises 
culturing a cell as claimed in claim 8 and recovering 
expressed protein. 
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-10 -5 
Met Gin Thr Gin Lys Pro Thr Pro Ser Ser Lvs 
coding ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 



Non-coding T T C 



1 5 

Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 

CTTAAAAAGAATTCCGATAGCGAGTGTCCTCTG 



GAATTTTTCTTAAGGCTATCGCTCACAGGAGAC 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGTCTACATGACGGCGTC 



TCAGTGCTACCAATGACAGATGT ACT GCCGCAG 



20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 



ACATACATATAACTCCGAGATCTGTTCATGCGC 



Fig . 1 (Part 1of 2) 
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35 40 
Cys Asn Cys Val Val Gly Tyr He Gly Glu Arg 
TGTAATTGCGTTGTTGGCTACATCGGTGAGCGC 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 



Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 



ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 



55 

Arg Ter Gly Ser 

AGATAAGGATCC coding 



TCTATTCCTAGG Non-coding 



Fig .1 (Part 2 of 2) 
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Hind HI 

-10 -5 A,UI 

Met Gin Thr Gin Lys Pro Thr Pro Ser Ser Lys 

ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 

i * • » • 

* • • | 



TT C 



#• * 



I 



EcoRI Mn1 1 

1 • 5 



Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTGTCCTCTG 



• •••• 



6AATTTTT C TTAAGGCTA TC6CTCAC A66A6A C 



Hint I AccI Hgal 

10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 

AGT.CACGATGGTTACTu T*C TACATGACGGCG TC 
««<»*»« *«>»|»*» 



T C A C T G C T A C C A A T G A C A G A T G T»A CTGCC6CAG 

* Hgal 



Mn1l * Xbal FnuDlI 
20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 



AC ATACA TA TAACTC CGAGAT CTGTTCATGCGC 

Hhal 



Fig. 2 (Part lof 2) 



0046039 



4/15 



HphI Haell 
35 AO 
Cys Asn Cys Val Val Gly Tyr He Glv Glu Arg 
T G T AA T TGC GT T G T T G GC T AC AT C G6 T G AG CG C 

***l 

ACATTAACGCAACAACCGATGTAGCCACTCGCG 

++++++♦♦♦ 

Mbol 
TaqI Bglll 

45 50 
Cvs Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
T GTOAGT ATCGAG ATCTGAAATGGT GGGA ACTT 

•r # * 

A C A*G TCATAGCTCTAGACTTTACCACCCTTGAA 
HphI I* 1 



BamHI 
Mbol 

Arg Ter Gly Ser . 
AG A TAAGGATC C 

TCTATTCCTA GG 

I" 

...... M 



. 2 (Part 2 of 2) 
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coding A G 



Non-coding — 



1 5 

Leu Lys Lys Ash Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCG AT AGCGAGTGTCCTCTG 

1- 1 

I (13) III (18) 



-—A TTTTTCTTAAGGCTATCGCTCACAGGAGAC 

! . 1 

1105) IV (18) 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
A G T C A C G AT G Gi TACTGTCTACATG AX G G> C G T C 

! T __„ J. 

V (18) VII (18) 



TCAGTGCTACCAATGACAGATGTACTGCCGCAG 

— I I - 

VI (18) VIII (18) 



20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr At a 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 

I - I 

IX (12) XI (12) I- -- 

XIII (18) 

ACATACATATAACTCCGAGATCTGTTCATGCGC 

1 1 

X (18) I- 

XII (20) 



Fig. 3 (Part 1of 2) 



6/15 



0046039 



35 40 

Cys Asn Cys Val Val Gly Tyr He Gly Gtu Arg 

TGTAATT GCGTTG TTGGCTACATCGGTGAGCGC 
1 1 -— I- 

XV (16) XVII (13) 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 
1 1 

XII (20) XIV (16) XVI (12) 



45 50 
Cys Gin Tyr Arg Asp Leu * Lys Trp Trp Glu Leu 

TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 

1 ! „ 

XIX (15) XXI (12) XXIII (14) 

ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 

- I 1 -I- 

XVIII (14) XX (14) 



Arg Ter 
AG 



AT A AG coding 
XXIII (14) 

If JAUff.!*.? Non-coding 
XXII (12) 



Fig.3 (Part 2 of 2) 
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coding A G 



Non -coding — 



1 5 
Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAaAAAGAAT TCCG AT AGCGAGTfcTCC TCTG 
j -I 

I (13) III (18) 



-— ATT TT TC TTA AGGC TATCGCTC AC AGGAGAC 

I I 

11(15) IV (18) 



10 15 

Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTTACTGTCTACATGACGGTGTC 

1 !_ 

V (18) Vll*(18) 



TCAGTGCT AC CAATGACAGATGTAC TGCCACAG 

— I I 

VI (18) VIII*(18) 



20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 

TGTATGTATATCGAAGCTCTAGACAAGTACGCG 

I -- I 

IX (12) XI (12) I — 

XIII (18) 

ACATACATATAGCTTCGAGATCTGTTCATGCGC 

! , 

X'(18) I— — 

XII (20) 



Fig . 4 (Part 1 of 2) 
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35 40 
Cys Asn Cys Val Val Gly Tyr He Gty Glu Arg 

T GTAATTGCGT TG TT GGCTACATCGGTGAGCGC 
1 _ 1 i- 

XV (16) XVII (13) 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 

1 ! 

XII (20) XIV (16) XVI (12) 



45 50 
Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 

1 - I 

XIX (15) XXI (12) XXIII (14) 

ACAGTCATAGCTCTAGACTT TACCACCCTT GAA 

I 1 1- 

XVIII (14) XX (14) 



Arg Ter 
AG 



ATA AG Coding 



XXIII (14) 



IiJ-iLIf .?JA? "on-coding 
XXII (12) 



Fjg . 4 (Part 2of 2) 
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T + C 

I 

T.C C+C T ♦ T 

I I 



C C.C + A T.T + T 




.C.C.C.A C.C.A.T.T.T 




A.G.T.T.C..C.C.A.C.C.A.T.T.T 

* 

Fig. 5 
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DMTr 0 



0 N 




HO 



0 iN 



CI 



triethylamine 
in pyridine 



2°/o benzene sulphonic 
acid in 

chloroform /methanol 




t 



DMTr 



— I 0 N 

v 



g-^~^-o— p=o 



Fig. 7 



condensing agents 



DMTr NpNp-OCHpCHpCN 
II 
0 0 

Column 



repeat cycle 



NNN 
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Column 
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hand portion 
3 5 



11 



(sub-molar) 

( sub- m olar) 

8 




8 ^ 10 
(some dimerisation) 



3 * 5 . 7 A 9 . 11 



1 



V 



HindJE 
Xba I 



^ 7 „ 9 



2 v A v 6 v 8 ^~10 



Fig. 8 (Part 1of 2) 
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Right-hand portion 



13 



15 



17 



12 



14 

i 



12_^J3 



13 v 12 



(dimer) 



14 




17 15 



19 



21 



23 



16 



18 



19 



16 18 




20 



22 



23_^22 „ 20 



20 * 22 * 23 
(dimer) 




19 . 21 * 23 a 22 a 20 a 18 a 16 




16 18 20 22 23 21* 19 



14 , 12 . 13 . 15 . 17 , 19 , 3 

. 17 w 15 w 13 * 12 * 14 * 16 w 18 ' 



19 a 21 a 23 a 22 



20 



18 v ~20 



Xba I 
Bam HI 



22 * 23 v 21 " 19 





12 "^H^ 16 



18 """aP' 22 



Fig.8 ( Part 2 of 2 ) 
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RI 



Bam HI 
Gel purify 
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transform 
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Xbal 




RI 
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Gel purify 




Bam HI 



Bam HI 



Fig.9 
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Description 

This invention relates to a synthetic orogastrone gene, to corresponding plasmid recombinants and 
transformed cells, to the production thereof and to urogastrone expression. 

5 Urogastrone is a polypeptide hormone (protein) synthesised in the duodenum and in the salivary 
glands of normal humans, (see, for example, Heitz, ef al, (1978), Gut 19, 408—413). Urogastrone 
suppresses the secretion of gastric acid and promotes cell growth, (see, for example, Elder, eta/, (1975), 
Gut, 16, 887—893). Therefore, it has an application in the treatment of ulcers and in the promotion of 
wound healing. Urogastrone is excreted in small amounts in human urine and may be isolated therefrom. 

jo There exists, however, a need for a more viable commercial production thereof and such is provided 
according to the present invention. 

Urogastrone is known to consist of 53 amino acids in the following sequence: 

Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp- 
Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr- 
'5 lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys- 

Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr- 
Arg Asp Leu Lys Trp Trp Glu Leu Arg 

(see, for example, Gregory, H., and Preston, B. M„ (1977), Int. J. Peptide Protein Res., 9, 107 — 118). 

From the above amino acid sequence, a corresponding synthetic gene sequence has been invented, 
subject to a number of specific non-obvious criteria, and oligonucleotide blocks synthesised which, when 
assembled, form a synthetic gene coding for urogastrone. The blocks have been hybridised and ligated in 
p re-determtned stages to construct the urogastrone gene in two portions. These have been cloned in two 
operations into a new specifically-designed chimeric £ coii/S. aureus vector so as to produce a full length 
urogastrone.gene flanked only by £ coli plasmid DNA. The gene has been excised from this recombinant 
and re-cloned into vectors specifically designed to maximise expression of the gene in £ coli, under the 
control of the promoter obtained from the £ coli tryptophan operon. A protein resembling human 
urogastrone has thus been expressed in £ colL 

From the above amino acid sequence, because of the degeneracy of the genetic code, it is possible to 
predict numerous necleotide sequences which would code for the protein. 

In the inventive determination of an optimum sequence from the large number of possibilities, several 
non-obvious criteria have been observed. Firstly, trinucleotide codons should be used which are acceptable 
or preferable in the cells to be used, in particular £ coli. Secondly, it was decided that it was desirable to 
have different restriction enzyme recognition sites at the termini of the molecule so as to allow insertion 
into a plasmid in a desired orientation. Moreover, it was decided to select sites which allowed the use of 
well-understood cloning vectors, such as pBR322 (see, for example, Bolivar, F., ef al, (1977), Gene, 2, 
95 — 113). In fact. Hind III and Bam HI sites were selected and introduced at the 5' and 3' ends, respectively. 
Thirdly, it was thought desirable to introduce a series of restriction endonuclease recognition sites 
strategically placed along the molecule to enable the gene to be specifically disected to aid characterisation 
and, possibly mutagenesis. Also, this measure allowed the two portions of the molecule to be cloned in 
stages. In particular, an Xba I site was introduced at a central location in the gene. Fourthly, the synthesis 
should not be unnecessarily complicated and illegitimate cross-hybridisations should be minimised in 
order to facilitate gene assembly. Using a computer, energies of interactions arising from all possible 
approximations were- calculated (see, for example, Tinoco, Jr., I., ef al, (1971), Nature, 230, 362 — 367; 
Powers, G. J. et al, (1975), JACS, 97, 875(889), so that stable off-diagonal interactions migjpt be avoided 
where possible. Fifthly, since the protein ultimately expressed in bacterial cells will be in the form of a 
fusion product, it was desirable to have a means of cleaving the urogastrone portion from such a fusion 
product Since urogastrone is known to be insensitive to trypsin (see, for example, Gregory and Preston, 
loccit), the codons specifying the dipeptide lys-lys were introduced near the end of the gene corresponding 
to the urog astro n N-terminus in order to serve as a substrate for tryptic proteolysis. 

The present invention relates to a synthetic gene characterised in that it codes for the expression of 
urogastrone or an equivalent thereof or a sub-unit thereof in a bacterial cell and comprises the following 
sequence or sub-unit thereof: 

5'AATTCCGATAGCGAGTGTCCTCTG- 
3' T T A A G G C T A T C G C T C A C A G 6 A G A C- 
AGTCACGATGGTTACTGTCTACATGACGG- 
TCAGTG CTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAGGCTCTAGACA- 
GCAGACATACATATAACTCCGAGATCTGT- 
60 AGTACG CGTGTAATTGCGTTGTTGGCTAC- 

T C A T G C G C A C A T T A A C G C A A C A A C C G A T G- 
ATCGGTGAGCGCTGTCAGTATCGAGATCT- 
TAGCCACTCGCGACAGTCATAGCTCTAGA- 
GAAATGGTGGGAACTTAGA 
65 CTTTACCACCCTTGAATCT 



35 



40 



45 



SO 



55 



2 



0 046 039 



In fact, bearing in mind, inter alia, the above-mentioned considerations, for convenience, a slightly 
longer sequence was selected which is as follows: 

Coding 5'AGCTTAAAAAGAATTCCGATAGCGAGT- 
5 Non-Coding 3'ATTTTTCTTAAGGCTATCGCTCA- 

GTCCTCTGAGTCACGATGGTTACTGTCT- 
CAGGAGACTCAGTGCTACCAATGACAGA- 
ACATGACGGCGTCTGTATGTATATTGAG- 
TGTACTG CCGCAGACATACATATAACTC- 
to GCTCTAGACAAGTACGCGTGTAATTGCG- 

CGAGATCTGTTCATGCGCACATTAACGC- 
TTGTTGGCTACATCGGTGAGCGCTGTCA- 
AACAACCGATGTAGCCACTCGCGACAGT- * 
GTATCGAGATCTGAAATGGTGG GAACTT- 
J5 CATAGCTCTAGACTTTACCACCCTTGAA- 

A G A T A A G 
TCTATTCCTAG 

* 

The whole of this expanded sequence is shown in Figure 1 of the accompanying drawings and the 
20 restriction enzyme recognition sites are shown in Figure 2 of the accompanying drawings. 

The present invention also relates to a process for the production of the above said synthetic genes or a 
sub-units thereof characterised in that it comprises the assembly and ligation of a number of 
oligonucleotide blocks. 

Such "blocks" should not be regarded as sub-units of the gene. In the present context, "sub-unit" 
25 refers to a sequence which is less than the whole, but which exhibits the desired properties. 

It was in fact decided to synthesise a molecule having the above expanded sequence by making 23 
synthetic oligonucleotide blocks as illustrated in Figure 3 of the accompanying drawings, which will 
assemble by single-strand overlaps to give the complete double-stranded nucleotide sequence. 

In order to minimise strong illegitimate interactions near the centre of the molecule during assembly, 
30 certain blocks of modified sequence have also been synthesised, as illustrated in Figure 4 of the 
accompanying drawings. 

As mentioned above, the synthetic blocks selected are shown in Figure 3 of the accompanying 
drawings. The blocks may be constructed using known synthesis techniques (see, for example, Agarwal, et 
ah (1970), Nature, 227, 27—34; and Crea, et ai, (1978), Proc. Natl. Acad. Sci. U.S.A., 75, 5765— 5769). 
35 The synthetic methods will now be illustrated with reference to the synthesis of the 
tetradeca nucleotide ApGpTpTpCpCpCpApCpCpApTpTpT. 

The methods of building up oligonucleotides from smaller units by successive coupling reactions are 
well known (see, for example, Hsiung, et ai, (1979), Nucleic Acid Research, 6, 1371 — 1385). The completely 
protected tetradecanucleotide was built up as shown in Figure 5 of the accompanying drawings (wherein, 
40 for convenience, protecting groups are not shown). 

The condensation reactions indicated by arrows in Figure 5 were carried out by the following 
procedure exemplified in the synthesis of: 

DMTr A^p G ,8u p-CNEt 

45 : 

4>CI <pC\ 

as shown in Figure 6 of the accompanying drawings. From 1.1 to 1.5 mmole of the 3'-phosphodiester 
component (I) was condensed with 1.0 mmole of the 5'-hydroxyl component (II) in anhydrous pyridine in 

so the presence of from 3 to 4.5 mmoles of 2,4,6-triisopropyl-benzene-sulphonyl-tetrazolide. The reaction was 
left for 1 hour at room temperature or until chromatography on silica TLC plates eluted with 10% (v/v) 
methanol in chloroform showed that the hydroxyl component was exhausted. The reaction was quenched 
with 5% (w/v) sodium bicarbonate solution and extracted with chloroform. The chloroform extract was 
dried and loaded onto a reverse phase chromatography column (ODS bonded to 15—25 micron silica). The 

55 fully protected dinucleotide product (III) was eluted with a solvent gradient from 
chloroform: methanol: water (2:6:3 v/v) to chloroform: methanol: water (2:6:0.5 v/v). The product (III) was 
extracted into chloroform and dried. The final isolated yield was 81%. 

To proceed to further condensations, the terminal protecting group (DMTr or CNEt) was removed 
selectively using triethylamine in pyridine (CNEt) or a 2% (w/v) solution of benzene sulphonic acid in 

60 chloroform: methanol (DMTr) as shown in Figure 7 of the accompanying drawings. 

At the completion of the synthesis, all of the protecting groups were removed by sequential treatment 
with 0.1 M tetraethylammonium fluoride in THF/pyridine/water (8:1:1 v/v), ammonia and 80% acetic acid. 
The deprotected oligonucleotides were purified by ion-exchange HPLC and sequence analysis was carried 
out by the method of Wu, et ai, (1976), Anal. Biochem., 74, 73—93, 

65 The oligomeric blocks of nucleotides were hybridised and ligated (see, for example, Agarwal, et a/, ioc 
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cit) in a series of steps, in order to minimise the possibilities for undesirable interactions, leading to the 
formation of the two portions as shown in Figure 8 of the accompanying drawings. The order of the 
additions in the assembly scheme was optimised for minimal incorrect ligations and in the case of 
especially difficult oligomeric blocks, notably 7 and 8, sub-molar quantities were used in order to remove 
5 all monomeric units before further additions were made, 
tn more detail: 

Left-hand portion: Blocks 1 and 2 were ligated to form a dimer about the Hind 111 site. Blocks 4, 5 and 6 
and 8, 9 and 10 were also ligated in the first round of the assembly scheme. Molar equivalents were used 
for all but block 8 where 0.75 molar equivalents were employed. 
w Block 3 was ligated with 1 +2, block 7 (0.75 molar equivalent) with 4 to 6 and block 11 with 8 to 10. The 
S+g+10+11 assembly has one flush end, hence some blunt-end dimerisation was observed. 

1 to 3 and 4 to 7 were ligated and finally 8 to 11 were ligated to the resulting 1 to 7 species. 

The dimeric 1 to 11 left-hand portion was then cleaved by Hind III (EC 3.1.23.21) and Xba I (EC 3.1.23.4) 
to generate the monomeric left-hand portion, with the correct cohesive termini to allow construction of 
is recombinant ptasmids. 

Right-hand portion: Blocks 12 and 13 were ligated to form a dimer about the Xba I site and blocks 20, 23 
and 22 similarly ligated to form a dimer about the Bam Hi site. Blocks 14, 15 and 17, and 16, 18 and 19 were 
also ligated at this stage. 

The 12, 13 dimer and the 14, 15, 17 assembly were ligated, as were the 16, 18, 19 asesmbly with the 20, 
20 22, 23 dimer, where block 21 was used as a joining section. 

These two species were then ligated to give an oligomeric molecule which was cleaved by Xba I and 
Bam Hi (EC 3.1.23.6) to give the monomer 12 to 23 species having the correct cohesive termini to allow 
construction of recombinant plasmids. 

The present invention further relates to a plasmid recombinant characterised in that it comprises a 
25 plasmid vector having inserted therein at an appropriate insertion site a synthetic gene or a sub-unit 
thereof as defined above, the plasmid recombinant enabling translation in the correct phase for the mRNA 
corresponding to the inserted gene or sub-unit thereof and having a bacterial promoter upstream of and 
adjacent to the insertion site such that the inserted gene or sub-unit thereof is under bacterial promoter 
control. 

30 A particular vector, designated pLFI, has been specifically designed to afford advantages for the 
purposes of the present invention. The inventive plasmid pLFI, is a 5K bp plasmid which may be 
propagated in £ coli and which may be constructed from pBR322 and pUB110 (see, for example, Gryczan, 
T. J., eta/, (1978), J. Bacterial., 134, 318) by inserting the DNA sequence of the 5. aureus plasmid pUBIIO 
between the EcoHl and Bam HI sites thereof (comprising approximately 870 bp) between the EcoHl and 

3s Bam HI sites of pBR322, thereby replacing that region of pBR322. 

The present invention further relates to a process for the production of such a plasmid recombinant 
characterised in that it comprises inserting such a synthetic gene or a sub-unit thereof as defined above 
into an appropriate insertion site of an appropriate plasmid vector. 
The following illustrates the present invention: 

40 Phosphorylation of oligonucleotide blocks: In each case, 6 ug of oligomer and 60 jiCi of [y- 32 P] ATP 
(>5000 Ci/mMol) were dried and redissolved in a final buffer concentration of 50 mM Tris-HCI, pH 7.8, 10 
mM MgCI 2 , 0.25 mM ATP, 10 mM 0-mercaptoethanol and incubated with 4 units (1 unit is the amount that 
catalyses the production of 1 n mole of acid-insoluble 32 P after incubation for 30 minutes at 37°C according 
to Richardson, C.C., (1972), Progress in Nucleic Acids Research, 2, 815) of T4 polynucleotide kinase (EC 

45 2.7.1 .78, Bethesda Research Labs) at 37°C for 1 5 minutes. The enzyme was subsequently inactivated by a 5 
minute incubation at 100°C. 

Ligation of oligonucleotide blocks: Except where indicated above, equtmolar quantities (6 to 18 pg) of 
oligonucleotide blocks were incubated in 50 mM Tris-HCI, pH 7.8, 10 mM MgCI 2 , 1 mM ATP, 20 mM 
dithiothreitol (DTT) with 6 units (1 unit is the amount that catalyses the conversion of 1 n mole of M PPi into 

so (a/0 32 P)-ATP in 20 minutes at 37°C according to Weiss, B., et at, (1968), J. Biol. Chem., 243 4543) of T4 ON A 
ligase (EC 6.5.1.1, Bethesda Research Labs) at 25°C for from 3 to 16 hours. Ligated DNA was precipitated by 
addition of 2.5 vol absolute ethanol, collected by centrifugation and redissolved in water. 

Purification of ligated species: Ligated oligonucleotide blocks were electrophoresed on 20% (w/v) 
polyacrylamide in 90 mM Tris-HCI, pH 8.3, 90 mM boric acid, 2.5 mM EDTA (TBE buffer), and the fragments 

55 located by autoradiography. Slices of gel containing the fragments were excised and the DNA electroeluted 
at 1 mA in TBE buffer onto 0.5 ml of DEAE cellulose (DE52, Whatman) for a few hours. After extensive 
washing by 0.1MNH 4 OAc, 2 mM Mg(OAc) 2 , 0.02% (w/v) SDS, 0.02 mM EDTA (AGEB buffer), the DNA was 
eluted from the DEAE cellulose by 2 ml of 1.1 M NaCI in AGEB buffer and precipitated by addition of 2.5 vol 
absolute ethanol. 

eo Construction of chimeric cloning vector pLF1 : In order to facilitate the two-stage cloning of 
urogastrone, it was desirable to construct a vector or vectors having Hind til, Xba I and Bam HI cleavage 
sites, see Figure 9 of the accompanying drawings. However, no readily available £ coli plasmids possess 
Xba I sites. It was surprisingly noticed that the S. aureus plasmid pUB1 10, which cannot be propagated in £ 
coli (see, for example, Keggins, K. M. et a/ (1978), Proc. Natl. Acad. Sci. . U.S.A., 75, 1423), contains a region 

65 of about 1kpb of DNA bounded by sites for £coRI and Bam HI having an approximately central Xba I site. 
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Therefore, pUB110 was cleaved with EcoH\ (EC 3.1.4.32) and Bam HI and the DNA fragments 
electrophoresed on 5% (w/v) polyacrylamide. The approximately 1kbp EcoRUBam HI fragment was 
removed by electroelution from the excised gel slice onto DEAE cellulose, eluted by 1.1 M NaCI and ethanol 
precipitated. The £ coli plasmid pAT153 (see, for example, Twigg, A. J., and Sherratt, D., (1980), Nature, 

5 283, 216— 218) was also cleaved with £coRI and Bam HI, and the 3282 bp fragment purified by 
electroelution as above. These two purified DNA fragments were ligated in equimolar quantities using 10 
units of T4 DNA ligase in 50 mM Tris -HCI, pH 7.8, 10 mM MgC1 2 , 1 mM ATP, 20 mM DTT at 15° for 18 hours. 
After ethanol precipitation, the ligated DNA was dissolved in 100 ul 10 mM Tris pH 7.5, 10 mM MgCI 2 , 10 
mM CaCI 2 and transformed into £ coli K12 HB101 (genotype gal", lac", ara", pro", arg~, sir 1 , rec A", rj, M k ; 

jo see, for example, Boyer, H. W., and Roullard-Dussoix, D., J. Mol. Biol., 41, 459—472) using known methods 
(see, for example, Cohen, ef a/, (1972), Proc. Natl. Acad. Sci. U.S.A., 69, 2110—2114) and transformants 
resistant to 100 ug/ml ampicillin selected. Several transformants were analysed further by restriction 
enzyme cleavage of plasmid DNA, and one full length clone, designated pLF1 r selected for further use as a 
cloning vector. At a later stage, an additional Hind III restriction sequence was created at the £coRI site. 

1 5 Cloning of the synthetic urogastrone gene in pLF1 : The two portions of the assembled urogastrone 
gene were cloned in two transformation stages, as illustrated in Figure 10 of the accompanying drawings. 

Right-hand portion: The longer fragment of Xta I, Bam Hl-cleaved pLF1 was purified by electroelution 
as above and ligated to a large excess of the right-hand portion assembly of the urogastrone gene using 10 
units T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM MgCI 2 , 1 mM ATP, 20 mM DTT at 15°C for 18 hours. 

20 This was transformed into £ coli as before, with selection for 100 ug/ml ampicillin. Several transformants 
were selected for plasmid analysis by restriction enzyme cleavage analysis and one clone, designated 
pUR1, used for further cloning. The sequence of the inserted urogastrone gene DNA was confirmed by 
chemical degradation analysis (see, for example, Maxam, A., and Gilbert, W„ (1977), Proc. Natl. Acad. Sci. 
U.S.A., 74, 560-4). 

2$ in order to have a Hind III site for ligation to the 5' end of the left-hand portion of the gene, the £coRI 
site was modified as follows: pUR1 was cleaved with EcoRI and the resulting recessed ends filled using 5 
units (1 unit is the amount that incorporates 10 n moles of total nucleotides into an acid-precipitable 
fraction in 30 minutes at 37°C using poly-d(A-T) as primer according to Richardson, C.C., et al, (1964), J. 
Biol, Chem., 239, 222) of DNA polymerase I (EC 2.7.7.7) in 50 mM Tris-HCI, pH 7.5, 10 mM MgCI 2 , 1 mM 

30 p-mercaptoethanol with 0.25 mM of each deoxynucleotide for 30 minutes at 15°C. 

To the resulting flush ends, were ligated a large excess of synthetic Hind III linkers (Collaborative 
Research) by the known blunt end ligation procedure (see, for example, Ullrich, A., ef al, (1977), Science, 
196, 1313—1319). The DNA was ligated using 6 units of T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM 
MgCI 2 , 1 mM ATp, 20 mM DTT at 15°C for 18 hours. 

35 After Hind III restriction cleavage and purification, the full length plasmid was religated and 
transformed into £ coli as above, with selection for 100 ug/ml ampicillin. A transformant having a Hind HI 
site, designated pUR2, was selected for further cloning. 

Left-hand portion: pUR2 was cleaved with Hind III and Xba\ restriction enzymes and the longer 
fragment purified by electroelution as above. This was ligated to a two-fold molar excess of the assembled 

40 left-hand portion of the urogastrone gene using 6 units T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM 
MgCI 2 , 1 mM ATP, 20 mM DTT for 18 hours at 15°C. This was transformed into £ coli K12 MRC 8 (genotype 
dap 103 hsd R met Bl glm 533 upp 1 dap 101 sup E thy A 103 deo rec A1) with selection for 100 ug/ml 
amicillin. Several transformants were selected for plasmid analysis by restriction enzyme cleavage. One 
clone, designated pUR1 was used for further characterisation and cloning. The sequence of the full 

45 urogastrone gene was confirmed by chemical degradation analysis, it should be noted that pURI has no 
remaining S aureus DNA sequence present. 

The present invention also relates to a bacterial cell, in particular an £ coli cell, characterised in that it 
comprises inserted therein a plasmid recombinant as defined above. 

The present invention further relates to a process for the production of such a bacterial cell 

so characterised in that it comprises inserting a plasmid recombinant as defined above into a bacterial cell. 

Expression of urogastrone in £ coli: The urogastrone gene insert was cleaved from pUR1 by Hind III 
and Bam HI cleavage and purified by polyacrylamide gel electrophoresis and electroelution as above. This 
fragment was ligated to Hind III, Bam Hl-cleaved pWT121 and pWT221, (see, for example, Tacon, W.C.A., ef 
al, (1980), Molec. Gen. Genet 177, 427) and the recombinant molecules used to transform £ coli MRC 8, 
55 (see for example, Emtage, J. S., et al, (1980), Nature,2S3, 171— 174), with selection for 100 ug/ml ampicillin. 
Transformants containing full length urogastrone genes were characterised by restriction enzyme cleavage 
analysis and DNA was purified by isopycnic centrifugation in caesium chloride. 

Expression of urogastrone-Iike fusion protein was induced by growth of cells in L-broth (luria broth: 
60 1% (w/v) bacto tryptone., 0.5% (w/v) bacto yeast extract, 0.5% (w/v) NaCI, 0.2% (w/v) glucose, 0.004% (w/v) 
thymine, pH 7) containing 100 ug/ml ampicillin to an A600 nm of 0.3. Following centrifugation, the cells 
were washed and resuspended in M9 medium lacking tryptophan, but containing 20 ngml 3 3-indole acrylic 
acid. The cells were incubated at 37°C for 4 hours. Under these conditions maximal tryptophan promoter 
activity is known to occur (see Tacon, ef al, loc cit), and hence expression of the urogastrone fusion protein. 
55 The present invention also relates to a process for the production of urogastrone or an equivalent 
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thereof or a sub-unit thereof characterised in that it comprises cuituring a bacterial cell as defined above 

and recovering expressed protein. 

The present invention further relates to a fused polypeptide characterised in that it comprises 

urogastrone or an equivalent thereof or a sub-unit thereof covalently bonded with all or part of a gene of 
5 the tryptophan operon, e.g. trp E, and in that it is obtained by cuituring a bacterial cell as defined above and 

recovering expressed protein. 

As mentioned above, urogastrone has an application in the treatment of ulcers, and also in other 

instances where the growth promoting activity thereof would be beneficial, for example, in wound healing. 

Conventional administration forms may be used, the active material being used in an effective amount, for 
io example from 0.1 to 1.0 jig/kg body' weight preferably from 0.125 to 0.5 ug/kg body weight more 

preferably about 0.25 pg/kg body weight, optimally together with a conventional pharmaceutically- 

acceptable carrier, diluent or adjuvant 



Claims 

is 



1. A synthetic gene characterised in that it codes for the expression of urogastrone or an equivalent 
thereof or a sub-unit thereof in a bacterial cell and comprises the following sequence or a sub-unit thereof: 



Coding: 5'AATTCCGATAGCGAGTGTCCTCT- 

Non-coding: 3' TT A A G G C T AT C G C TC A C A G G A G A- 

GAGTCACGATGGTTACTGTCTACATGACGG- 

CTCAGTGCTACCAATGACAGATGTACTGCC- 

CGTCTGTATGTATATTGAGGCTCTAGACAA- 

GCAGACATACATATAACTCCGAGATCTGTT- 

GTACGCGTGTAATTGCGTTGTTGGCTACAT- 

CATGCGCACATT. AACGCAACAACCGATGTA- 

CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 

GCCACTCGCGACAGTCATAGCTCTAGACTT- 

ATGGTGGGAACTTAGA 

TACCACCCTTGAATCT 

2. A synthetic gene as claimed in Claim 1 characterised in that it comprises the following sequence or a 
sub-unit thereof: 



3S Coding: 5' AGCTTAAAAAGAATTCCG ATAG- 

Non-coding: 3'ATTTTTCTTAAGGCTATC- 
CGAGTGTCCTCTGAGTCACGATGGTTACT- 
GCTCACAGGAGACTCAGTGCTA. CCAATGA- 
GTCTACATGACGGCGTCTGTATGTATATT- 

40 CAGATGTACTGCCGCAGACATACATATAA- 

GAGGCTCTAGACAAGTACGCGTGTAATTG- 
CTCCGAGATCTGTTCATGCGCACATTAAC- 
CGTTGTTGGCTACATCGGTGAGCGCTGTC- 
GCAACAACCGATGTAGCCACTCGCGACAG- 

45 AGTATCGAGATCTGAAATGGTGGGAACTT- 

TCATAGCTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

so 3. A process for the production of a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 
2 characterised in that it comprises the assembly and ligation of a number of oligoncleotide blocks. 

4. A plasmid recombinant characterised in that it comprises a plasmid vector having inserted therein at 
an appropriate insertion site a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 2, the 
plasmid recombinant enabling translation in the correct phase for the mRNA corresponding to the inserted 

55 gene or sub-unit thereof and having a bacterial promoter upstream of and adjacent to the insertion site 
such that the inserted gene or sub-unit thereof is under bacterial promoter control. 

5. A plasmid recombinant as claimed in Claim 4 characterised in that it comprises as the plasmid vector 
pLFL 

6. A process for the production of a plasmid recombinant as claimed in Claim 4 characterised in that it 
so comprises inserting a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 2 into an 

appropriate insertion site of an appropriate plasmid vector. 

7. A bacterial cell characterised in that it comprises inserted therein a plasmid recombinant as claimed 
in Claim 4 or Claim 5. 

8. A process for the production of a bacterial cell as claimed in Claim 7 characterised in that it 
65 comprises inserting a plasmid recombinant as claimed in Claim 4 or Claim 5 into a bacterial cell. 
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9. A process for the production of urogastrone or an equivalent thereof or a sub-unit thereof 
characterised in that it comprises culturing a bacterial cell as claimed in Claim 7 and recovering expressed 
protein. 

10. A fused polypeptide characterised in that it comprises urogastrone or an equivalent thereof or a 
5 sub-unit thereof covalently bonded with all or part of a gene of the tryptophan operon and in that it is 

obtained by culturing a bacterial cell as claimed in Claim 7 and recovering expressed protein. 

11. A fused polypeptide as claimed in Claim 10 wherein the gene of the tryptophan operon is the trp E 
gene. 

io Patentanspruche 

1. Synthetisches Gen, dadurch gekennzeichnet daG es die Expression fur Urogastron oder ein 
Aquivalent desselben oder eine Untereinheit desseiben in einer Bakterienzelle codiert und die folgende 
Sequenz oder eine Untereinheit derselben umfafit: 

15 

codierend: 5' AATTCCGATAGCG AGTGTCCTCT- 
nicht codierend: 3'TTAAGGCTATCGCTCACA-GGAGA- 
GAGTCACGATGGTTACTGTGTCTACATGACGG- 
CTCAGTGCTACCAATGACAGATGTACTGCC- 

20 CGTCTGTATGTATATTGAGGCTCTAG A C A A- 

GCAGACATACATATAACTCCGAGATCTGTT- 
GTACGCGTGTAATTGCGTTGTTGGCTACAT- 
CATGCGCACATTAACGCAACAACCGATGTA- 
CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 

25 GCCACTCGCGACAGTCATAGCTCTAGACTT- 

ATGGTGGGAACTTAGA 
TACCACCCTTGAATCT 

2. Synthetisches Gen nach Anspruch 1, dadurch gekennzeichnet, daft es die folgende Sequenz oder 
30 eine Untereinheit derselben umfaSt: 

codierend: 5' AGCTTAAAAAG AATTCCGATAG- 
nicht codierend: 3'ATTTTTCTTAAG G CTATC- 
CGAGTGTCCTCTGAGTCACGATGGTTACT- 
GCTCACAGGAGACTCAGTGCTACCAATGA- 
GTCTACATGACGGCGTCTGTATGTAT- 
CAGATGTACTGCCGCAGACATACATATAA- 
GAGGCTCTAGACAAGTACGCGTGTAATTG- 
CTCCGAGATCTGTTCATGCGCACATTAAC- 
CGTTGTTGGCTACATCGGTGAGCGCTGTC- 
GCAACAACCGATGTAGCCACTCGCGACAG- 
AGTATCGAGATCTGAAATGGTG GGAACTT- 
TCATAG CTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

3. Verfahren zur Erzeugung eines synthetischen Gens oder einer Untereinheit desselben nach 
Anspruch 1 oder Anspruch 2, dadurch gekennzeichnet daft es den Zusammenbau und die Verknupfung 
einer Anzahl von Oligonucleotid-Blocken umfa&t 

so 4. Rekombinantes Plasmid, dadurch gekennzeichnet daB es einen Plasm id-Vektor umfaBt, der ein 
synthetisches Gen oder eine Unterheit desselben nach Anspruch 1 oder Anspruch 2, eingefugt an einer 
geeigneten Insertionsstelle, aufweist, wobei das rekombinante Plasmid die Translation in der korrekten 
Phase fur die dem eingefugten Gen oder der eingefugten Untereinheit desselben entsprechende mRNA 
ermoglicht und einen solchen Bakterien-Pro motor von (stromaufwarts von) der Insertionsstelle und dieser 

55 benachbart aufweist daft das eingefugte Gen oder die Untereinheit desselben der Steuerung durch den 
Bakterien-Promotor unterliegen. 

5. Rekombinantes Plasmid nach Anspruch 4 # dadurch gekennzeichnet, daG es als den Plasmid-Vektor 
pLFI enthalt. 

6. Verfahren zur Erzeugung eines rekombinanten Plasmids nach Anspruch 4, dadurch gekennzeichnet, 
50 daft es das Einfugen eines synthetischen Gens oder einer Untereinheit desselben nach Anspruch 1 oder 

Anspruch 2 an einer geeigneten Insertionsstelle eines geeigneten Plasmid-Vektors umfaSt. 

7. Bakterienzelle, dadurch gekennzeichnet, daB sie darin eingefugt ein rekombinantes Plasmid nach 
Anspruch 4 oder Anspruch 5 enthalt. 

8. Verfahren zur Erzeugung einer Bakterienzelle nach Anspruch 1, dadurch gekennzeichnet da(5 es das 
55 Einfugen eines rekombinanten Plasmids nach Anspruch 4 oder Anspruch 5 in eine Bakterienzelle umfaGt. 
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9. Verfahren zur Erzeugung von Urogastron oder eines Aquivalents desselben oder einer Untereinheit 
desselben, dadurch gekennzeichnet, daR eine Bakterienzelle nach Anspruch 7 kultiviert wird und das 
exprimierte Protein isoliert wird. 

10. Fusioniertes Polypeptid, dadurch gekennzeichnet, dali es Urogastron oder etn Aquivalent 
5 desselben oder eine Untereinheit desselben kovaient gebunden mit dem gesamten Gen oder einem Teil 

des Gens des Tryptophan-Operons umfafct und daB es durch Kultivieren einer Bakterienzelle nach 
Anspruch 7 und Isolierung des exprimierten Proteins erhalten wird. 

11. Fusioniertes Polypeptid nach Anspruch 10, dadurch gekennzeichnet, dafc das Gen des Tryptophan- 
Operons das trp-E-Gen ist 



Revendications 

1. Gene synthetique caracteYise en ce qu'il code pour ('expression d'urogastrone ou un equivalent de 
cette derniere ou une sous-unite* de cette derniere dans une cellule bacterienne et comprend la sequence 
suivante ou une sous-unite de cette derniere: 



Codant: 5' AATTCCG ATAGCG AGTGTCCTCT- 

Non-codant: 3'TTAAGGCTATCGCTCACAG GAGA- 

GAGTCACGATGGTTACTGTCTACATGACGG- 

CTCAGTGCTACCAATGACAGATGTACTGCC- 

CGTCTGTATGTATATTGAGGCTCTAGACAA- 

GCAGACATACATATAACTCCGAGATCTGTT- 

GTACGCGTGTAATTGCGTTGTTGGCTACAT- 

CATGCGCACATTAACGCAACAACCGATGTA- 

CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 

GCCACTCGCGACAGTCATAGCTCTAGACTT- 

ATGGTGGG AACTTAGA 

TACCACCCTTGAATCT 

2. Gene synthetique selon la revendication 1, caracten'se en ce qu'il comprend la sequence suivante ou 
une sous-unite de cette derniere: 



Codant: 5'AGCTTAAAAAGAATTCCGATAG- 

Non-codant: 3'ATTTTTCTTAAG GCTATC- 

CGAGTGTCCTCTGAGTCACGATG GTTACT- 

GCTCACAG GAGACTCAGTGCTACCAATGA- 

GTCTACATGACGGCGTCTGTATGTATATT- 

CAGATGTACTGCCGCAGACATACATATAA- 

GAGGCTCTAGACAAGTACGCGTGTAATTG- 

CTCCGAGATCTGTTCATGCGCACATTAAC- 

CGTTGTTGGCTACATCGGTGAG CGCTGTC- 

GCAACAACCGATGTAGCCACTCGCGACAG- 

AGTATCGAGATCTGAAATGGTG GGAACTT- 

TCATAGCTCTAGACTTTACCACCCTTGAA- 

A G A T A A G 

TCTATTCCTAG 



3. Proc6de de production d'un gene synthetique ou d'une sous-unite de ce dernier selon la 
50 revendication 1 ou la revendication 2, caract6rise en ce qu'il comprend ('assemblage et la liaison d'un 

certain nombre de fragments d'oligonucleotides. 

4. Piasmide recombinant caract€rise en ce qu'il comprend un plasmide vecteur dans lequel est insere 
en un site d'insertion approprie un gene synthetique ou une sous-unite de ce dernier selon la revendication 
1 ou la revendication 2, le plasmide recombinant permettant la traduction en la phase correcte pour 

55 l'ARN-m correspondent au gene insure ou a la sous-unit6 de ce dernier et ayant un promoteur bacterien en 
amont et adjacent par rapport au site d'insertion, tel que le gene insert ou la sous-unite de ce dernier se 
trouve sous le controte du promoteur bacterien. 

5. Plasmide recombinant selon la revendication 4, caracteris6 en ce qu'il comprend en tant que 
plasmide vecteur pLFI. 

so 6. Proc6d6 de production d'un plasmide recombinant selon la revendication 4, caracterise en ce qu'il 
comprend I'insertion d'un gene synthetique ou d'une sous-unite de ce dernier selon la revendication 1 ou la 
revendication 2 dans un site d'insertion approprie d'un plasmide vecteur approprie. 

7. Cellulose bacterienne caracterisee en ce que s'y trouve insere un plasmide recombinant selon la 
revendication 4 ou la revendication 5. 

es 8. Precede de production d'une cellule bacterienne selon la revendication 7, caracterise en ce qu'il 
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comprend 1'insertion d'un plasmide recombinant selon la revendication 4 ou la revendication 5 dans une 
cellule bacterienne. 

9. Precede de production d'urogastrone ou d'un equivalent de cette derniere ou d'une sous-unite de cette 
derniere, caract6rise en ce qu'il comprend la culture d'une cellule bacterienne selon la revendication 7 et la 
recuperation de la proteme exprimee. 

10. Polypeptide fusionne caracteris6 en ce qu'il comprend I'urogastrone ou un equivalent de cette 
derniere ou une sous-unite de cette derniere liee a tout ou partie d'un gene de I'ope>on tryptophane et en ce 
qu'il est obtenu en cultivant une cellule bacterienne selon la revendication 7 et en recuperant la proteine 
exprimee. 

11. Polypeptide fusionn6 selon la revendication 10, caracterise en ce que le gene de l'ope>on 
tryptophane est le gene trp E. 
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-10 . -5 
Met Gtn Thr Gin Lys Pro Thr Pro Ser Ser Lvs 
codng ATGCAAAC ACAAAAACCGACTCCAAGCT CCA AG 



Non-coding T T C 



1 5 
Leo Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTGTCCTCTG 



GAA TTT T TCTTAAGGCTATCGCTCACAGGAGAC 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGTCTACATGACGGCGTC 



TCAGTGCTACCAATGACAGATGTACTGCCGCAG 



20 25 30 

Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 



ACATACATATAACTCCGAGATCTGTTCATGCGC 



Fig . 1 (Part 1of 2) 
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35 40 
Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg 
TGTAATt6cGTTGTTGGCTACATCGGTGAGCGC 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 



Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 



ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 



55 

Arg Ter Gly Ser 
AGATAAGGATCC coding 



TCTATTCCTAGG Non-coding 



Fig.1 (Part 2 of 2) 
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Hindll I 
_ Alu! 
-10 -5 
Met Gin Thr Gin Lys Pro Thr Pro Ser Ser Lys 
ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 

• | • • * 

I 



* t • 



T T C 

ff »ff » t 



• « * 



I 



EcoRI „ Mn1I 

1 5 
Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTGTCCTCTG 



§ # » 



GAATTTTTCTTAAGGCTATCGCTCACAGGAGAC 



»» » 



Hinfl AccI Hgal 

10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGT.CTACATGACGGCGTC 



TCACTGCTACCAATGACAGAT GT*AC T GCCGCAG 

r** Hgal 



Mnll • Xbal FnuOII 
20 • 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 



ACATACATATAACTCCGAGATCTGTTCATGCGC 

Hhal 



Fjg. 2 (Part 1 of 2) 
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HphI Haell 
35 40 
Cys Asn Cys Val Val Gly Tyr He Gly Glu Arg 
TGTAATTGCGTTGTTGGCT AC ATCG6TGAGCGC 

I 

ACATTAACGCAACAACCGATGTAGCCACTCGCG 

♦++++♦+4+ 

*l 

Mbol 
Taql Bglli 

45 50 
Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
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Description 

This invention relates to a synthetic urogastrone gene, to corresponding plasmid recombinants and 
transformed cells, to the production thereof and to urogastrone expression. 

Urogastrone is a polypeptide hormone (protein) synthesised in the duodenum and in the salivary 
glands of normal humans, (see, for example, Heitz et al. (1978), GUT, 19, 408-413). Urogastrone 
5 suppresses the secretion of gastric acid and promotes celt growth (see, for example, Elder et al. (1975), 
Gut 16, 887-893). Therefore, it has an application in the treatment of ulcers and in the promotion of 
wound healing. Urogastrone is excreted in small amounts in human urine and may be isolated therefrom. 
There exists, however, a need for a more viable commercial production thereof and such is provided 
according to the present invention. 
10 Urogastrone is known to consist of 53 amino adds in the following sequence : 

Asn Ser Asp Ser GIu Cys Pro Leu Ser His Asp- 
Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr- 
He GIu Ala Leu Asp Lys Tyr Ala Cys Asn Cys- 
/5 Val Val Gly Tyr lie Gly GIu Arg Cys Gin Tyr- 

Arg Asp Leu Lys Trp Trp GIu Leu Arg 

{see, for example, Gregory H. and Preston B. M. (1977), Int J. Peptide Protein Res., 9, 107-118). 

From the above amino acid sequence, a corresponding synthetic gene sequence has been invented, 

20 subject to a number of specific non-obvious criteria, and oligonucleotide blocks synthesised which, when 
assembled, form a synthetic gene coding for urogastrone. The blocks have been hybridised and llgated in 
pre-determined stages to construct the urogastrone gene in two portions. These have been cloned in two 
operations into a new specifically-designed chimeric E. coli/S. aureus vector so as to produce a full length 
urogastrone gene flanked only by E. coli plasmid DNA. The gene has been excised from this recombinant 

25 and re-cloned into vectors specifically designed to maximise expression of the gene in E. coli, under the 
control of the promoter obtained from the E. coli tryptophan operon, A protein resembling human 
urogastrone has thus been expressed in E. coli. 

From the above amino acid sequence, because of the degeneracy of the genetic code, it is possible to 
predict numerous nucleotide sequences which would code for the protein. 

30 In the inventive determination of an optimum sequence from the large number of possibilities, several 
non-obvious criteria have been observed. Firstly, trinucleotide codons should be used which are 
acceptable or preferable in the ceils to be used, in particular E. coli. Secondly, it was decided that it was 
desirable to have different restriction enzyme recognition sites at the termini of the molecule so as to 
allow insertion into a plasmid in a desired orientation. Moreover, it was decided to select sites which 

35 allowed the use of well-understood cloning vectors, such as pBR322 (see, for example, Bolivar F. et al. 
(1977), Gene, 2, 95-113). In fact, Hind 111 and Bam HI sites were selected and introduced at the 5' and 3' 
ends, respectively. Thirdly, it was thought desirable to introduce a series of restriction endonudease 
recognition sites strategically placed along the molecule to enable the gene to be specifically dtsected to 
aid characterisation and, possibly mutagenesis. Also, this measure allowed the two portions of the 

40 molecule to be cloned in stages. In particular, an Xba I site was introduced at a central location in the 
gene. Fourthly, the synthesis should not be unnecessarily complicated and illegitimate cross-hybridis- 
ations should be minimised in order to facilitate gene assembly. Using a computer, energies of 
interactions arising from all possible approximations were calculated (see, for example, Tinoco Jr. i et ai. 
(1971), Nature, 230, 362-367; Powers G. J. et ai. (1975), JACS, 97, 875-899, so that stable off-diagonal 

45 interactions might be avoided where possible. Fifthly, since the protein ultimately expressed in bacterial 
cells will be in the form of a fusion product, it was desirable to have a means of cleaving the urogastrone 
portion from such a fusion product Since urogastrone is known to be insensitive to trypsin (see, for 
example, Gregory and Preston, loc crt), the codons specifying the dipeptide lys-lys were introduced near 
the end of the gene corresponding to the urogastron N-terminus in order to serve as a substrate for tryptic 

so proteolysis. 

The present invention relates to a synthetic gene characterised in that "rt codes for the expression of 
urogastrone or an equivalent thereof or a sub-unit thereof in a bacterial cell and comprises the following 
sequence or sub-unit thereof : 

55 5'AATTCCGATAGCGAGTGTCCTCTG- 

3' T T A A G G C T A T C G C T C A C A G G A G A C- 
A G T C A C G A T G G T T A C T G T C T A C A T G A C G G- 
TCAGTGCTACCAATGACAGATGTACTGCC- 
C G T C T G T A T G T A T A T T G A G G C T C T A G A C A- 

60 G C A G A C A T A C A T A T A A C T C C G A G A T C T G T- 

A G T A C G C G T G T A A T T G C G T T G T T G G C T A C- 
TCATGCGCACATTAACGCAACAACCGATG- 
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ATCGGTGAGCGCTGTCAGTATCGAGATCT- 
T A G C C A C t C G C G A C A G T C A T A G C T C T A G A- 
GAAATGGTGGGAACTTAGA 
CTTTACCACCCTTGAATCT 



In fact, bearing in mind, inter alia, the above-mentioned considerations, for convenience, a slightly 
longer sequence was selected which is as follows : 

10 

Coding 5'AGCTTAAAAAGAATTCCGATAGCGAGT- 
Non-Coding 3'«ATTTTTCTTAAGGCTATCGCTCA- 
GTCCTCTGAGTCACGATGGTTACTGTCT- 
C A G G A G A C T C A G T G C T A C C A A T G A C A G A- 
75 ACATGACGGCGTCTGTATGTATATTGAG- 

T G T A C T G C C G C A G A C A T A C A T A T A A C T C- 
GCTCTAGACAAGTACGCGTGTAATTGCG- 
CGAGATCTGTTCATGCG CACATTAACGC- 
„ TTGTTGGCTACATCGGTGAGCGCTGTCA- 
20 AACAACCGATGTAGCCACTCGCGACAGT- 

GTATCGAGATCTGAAATGGTGGGAACTT. 
CATAGCTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTA.TTCCTAG . 

25 



The whole of this expanded sequence Is shown in Figure 1 of the accompanying drawings and the 
restriction enzyme recognition sites are shown in figure 2 of the accompanying drawings. 

The present invention also relates to a process for the production of the above said synthetic genes or 
30 sub-units thereof characterised in that rt comprises the assembly and ligation of a number of 
oligonucleotide blocks. 

Such « blocks » should not be regarded as sub-units of the gene. In the present context « sub-unit » 
refers to a sequence which is less than the whole, but which exhibits the desired properties. 

ft was in fact decided to synthesise a molecule having the above expanded sequence by making 23 
35 synthetic oligonucleotide blocks as illustrated in Figure 3 of the accompanying drawings, which will 
assemble by single-strand overlaps to give the complete double-stranded nucleotide sequence. 

In order to minimise strong illegitimate interactions near the centre of the molecule during assembly, 
certain blocks of modified sequence have also been synthesised, as illustrated in Figure 4 of the 
accompanying drawings. 

40 As mentioned above, the synthetic blocks selected are shown in Figure 3 of the accompanying 
drawings. The blocks may be constructed using known synthesis techniques (see, for example, Agarwa), 
et al. (1970), Nature, 227, 27-34 ; and Crea et al. (1978), Proc. Natl. Acad Sci. U.SA, 75, 5765-5769). 

The synthetic methods will now be illustrated with reference to the synthesis of the tetradecanuc- 
leotide ApGpTpTpCpCpCpApCpCpApTpTpT. 

45 The methods of building up oligonucleotides from smaller units by successive coupling reactions are 
well known (see, for example, Hsiung et al. (1979), Nucleic Acid Research, 6, 1371-1385). The completery 
protected tetradecanucleotide was built up as shown in Figure 5 of the accompanying drawings (wherein, 
for convenience, protecting groups are not shown). 

The condensation reactions indicated by arrows in figure 5 were carried out by the following 

50 procedure exemplified in the synthesis of : 



OMTr A te p G ,0u p-CNEt 

■ • 

55 <$C\ <j>CI 



as shown in Figure 6 of the accompanying drawings. From 1.1 to 1.5 mmole of the 3'-phosphodiester 
component (I) was condensed with 1.0 mmole of the 5'-hydroxyl component (II) in anhydrous pyridine in 

50 the presence of from 3 to 4.5 mmoles of 2,4,6-triisopropyl-benzene-sulphonyl-tetrazoiide. The reaction 
was left for 1 hour at room temperature or until chromatography on silica TLC plates eluted with 10 % (v/v) 
methanol in chloroform showed that the hydroxy! component was exhausted. The reaction was quenched 
with 5 % (w/v) sodium bicarbonate solution and extracted with chloroform. The chloroform extract was 
dried and loaded onto a reverse phase chromatography column (ODS bonded to 15-25 micron silica). The 

55 fully protected dinucleotide product (ill) was eluted with a solvent gradient from chloroform : methanol : 



4 



EP 0 046 039 B2 



water (2:6:3 v/v) to chloroform : methanol : water (2:6: 0.5 v/v). The product (III) was extracted into 
chloroform and dried. The final isolated yield was 81 %. 

To proceed to further condensations, the terminal protecting group (DMTr or CNEt) was removed 
selectively using triethyiamine in pyridine (CNEt) or a 2% (w/v) solution of benzene sulphonic acid in 

5 chloroform : methanol (DMTr) as shown in Figure 7 of the accompanying drawings. 

At the completion of the synthesis, all of the protecting groups were removed by sequential treatment 
with 0.1 M tetraethyiammonium fluoride in THF/pyridine/water (8:1 : 1 v/v), ammonia and 80% acetic 
acid. The deprotected oligonucleotides were purified by ion-exchange HPLC and sequence analysis was 
carried out by the method of Wu et al. (1976), Anal. Biochem., 74, 73-93. 

10 The oligomeric blocks of nucleotides were hybridised and ligated (see, for example, Agarwal et al., 
loc. cit.) in a series of steps, in order to minimise the possibilities for undesirable interactions, leading to 
the formation of the two portions as shown in Figure 8 of the accompanying drawings. The order of the 
additions in the assembly scheme was optimised for minimal incorrect ligations and in the case of 
especially difficult oligomeric blocks, notably 7 and 8, sub-molar quantities were used in order to remove 

15 ail monomelic units before further additions were made. 
In more detail : 

Left-hand portion : Blocks 1 and 2 were ligated to form a dimer about the Hind III site. Blocks 4, 5 and 
6 and 8, 9 and 10 were also ligated in the first round of the assembly scheme. Molar equivalents were used 
for all but block 8 where 0.75 molar equivalents were employed. 
20 Block 3 was ligated with 1+2, block 7 (0.75 molar equivalent) with 4 to 6 and block 11 with 8 to 10. 
The 8 + 9 + 10 + 11 assembly has one flush end, hence some blunt-end dimerisation was observed 

1 to 3 and 4 to 7 were ligated and finally 8 to 11 were ligated to the resulting 1 to 7 species. 

The dimeric 1 to 11 left-hand portion was then cleaved by Hind III (EC 3.1.23.21) and Xba I (EC 
3.1.23.4) to generate the monomeric left-hand portion, with the correct cohesive termini to allow 
25 construction of recombinant plasmids. 

Right-hand portion : Blocks 12 and 13 were ligated to form a cfimer about the Xba I site and blocks 20, 
23 and 22 similarly ligated to form a dimer about the Bam HI site. Blocks 14, 15 and 17, and 16, 18 and 19 
were also ligated at this stage. 

The 12, 13 dimer and the 14, 15, 17 assembly were ligated, as were the 16, 18, 19 assembly with the 20, 
30 22, 23 dimer, where block 21 was used as a joining section. 

These two species were then ligated to give an oligomeric molecule which was cleaved by Xba I and 
Bam HI (EC 3.1.23.6) to give the monomer 12 to 23 species having the correct cohesive termini to allow 
construction of recombinant plasmids. 

The present invention further relates to a plasmid recombinant characterised in that it comprises a 
35 plasmid vector having inserted therein at an appropriate insertion site a synthetic gene or a sub-unit 
thereof as defined above, the plasmid recombinant enabling translation in the correct phase for the 
mRNA corresponding to the inserted gene or sub-unit thereof and having a bacterial promoter upstream 
of and adjacent to the insertion site such that the inserted gene or sub-unit thereof is under bacterial 
promoter control. 

40 A particular vector, designated pLFl, has been specifically designed to afford advantages for the 
purposes of the present invention. The inventive plasmid pLFl, is a 5K bp plasmid which may be 
propagated in E coli and which may be constructed from pBR322 and pUB110 (see, for example, 
Gryczan, T. J. et al. (1978), J. Bacterial., 134, 318) by inserting the DNA sequence of the S. aureus plasmid 
pUB110 between the EcoRI and Bam HI sites thereof (comprising approximately 870 bp) between the 

45 EcoRI and Bam HI sites of pBR322, thereby replacing that region of pBR322. 

The present invention further relates to a process for the production of such a plasmid recombinant 
characterised in that it comprises inserting such a synthetic gene or a sub-unit thereof as defined above 
into an appropriate insertion site of an appropriate plasmid vector. 
The following illustrates the present invention : 

50 Phosphorylation of oligonucleotide blocks : In each case, 6 ng of oligomer and 60 ^Ci of [r^P] ATP 
(> 5 000 Cl/mMol) were dried and redissotved in a final buffer concentration of 50 mM Tris-HCl, pH 7.6, 10 
mM MgCI 2 , 0.25 mM ATP, 1 0 mM 0-mercaptoethanol and incubated with 4 units (1 unit is the amount that 
catalyses the production of 1 n mole of acid-insoluble after incubation for 30 minutes at 37 °C 
according to Richardson, C.C., (1972), Progress in Nucleic Acids Research, 2, 815) of T4 polynucleotide 

55 kinase (EC 2.7.1.78, Bethesda Research Labs) at 37 °C for 15 minutes. The enzyme was subsequently 
inactivated by a 5 minute incubation at 100 b C. 

Ligation of oligonucleotide blocks : Except where indicated above, equimoiar quantities (6 to 18 M-g) 
of oligonucleotide blocks were incubated in 50 mM Tris-HCl, pH 7.8, 10 mM MgCI 2 , 1 mM ATP, 20 mM 
drthiothreitoi (DTT) with 6 units (1 unit is the amount that catalyses the conversion of 1 n mole of ^PPi 
• 60 into (a/£ 32 P)-ATP in 20 minutes at 37 °C according to Weiss B. et al. (1968), J. Biol. Chem., 243 4543) of T4 
DNA ligase (EC 6.5.1.1, Bethesda Research Labs) at 25 °C for from 3 to 16 hours. Ligated DNA was 
precipitated by addition of 2.5 vol absolute ethanol, collected by centrif ligation and redtssolved in water. 

Purification of ligated species : Ligated oligonucleotide blocks were electrophoresed on 20 % (w/v) 
polyacrylamide in 90 mM Tris-HCl, pH 8.3, 90 mM boric acid, 2.5 mM EDTA (TBE buffer), and the 

65 fragments located by autoradiography. Slices of gel containing the fragments were excised and the DNA 
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electrocuted at 1 mA in TBE buffer onto 0.5 ml of DEAE cellulose (DE52, Whatman) for a few hours. After 
extensive washing by 0.1MNH<OAc, 2 mM Mg(OAc) 2 , 0.02 % (w/v) SDS. 0.02 mM EDTA (AGEB buffer), the 
DNA was eiuted from the DEAE cellulose by 2 ml of 1.1 M NaCI in AGEB buffer and precipitated by 
addition of 2.5 vol absolute ethanol. 
5 Construction of chimeric cloning vector pLF1 : in order to facilitate the two-stage cloning of 
urogastrone, it was desirable to construct a vector or vectors having Hind 111, Xba i and Bam Hi cleavage 
sites, see Rgure 9 of the accompanying drawings. However, no readily available E. coli plasmids possess 
Xba I sites. It was surprisingly noticed that the S. aureus plasmid p(JB110 t which cannot be propagated in 
E. Coli (see, for example, Keggins K. M. et ai. (1978), Proa Natl. Acad. Sci. U.S.A., 75, 1423), contains a 

10 region of about 1kbp of DNA bounded by sites for EcoRI and Bam HI having an approximately central Xba 
I site. Therefore, pUB110 was cleaved with EcoRI (EC 3.1.4.32) and Bam Hi and the DNA fragments 
electrophoresed on 5 % (w/v) polyacrylamide. The approximately 1 kbp EcoRl/Bam HI fragment was 
removed by efectroelution from the excised get slice onto DEAE cellulose, eiuted by 1 .1 M NaCI and 
ethanol precipitated. The E coli plasmid pAT153 (see, for example, Twigg, A. J., and Sherratt D. (1980), 

15 Nature, 283, 216-218) was also cleaved with EcoRI and Bam HI, and the 3282 bp fragment purified by 
electroelution as above. These two purified DNA fragments were ligated in equimolar quantities using 10 
units of T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM MgCfe, 1 mM ATP, 20 mM DTT at 15° for 
18 hours. After ethanol precipitation, the ligated DNA was dissolved in 100 pJ 10 mM Tris pH 7.5, 10 mM 
MgCIa, 10 mM CaCfe and transformed into E. coli K12 HB101 (genotype gat", lac—, arsr, pro—, arg~, 

20 sir, rec A—, r k - M k — ; see, for example, Boyer H. W; and Roullard-Dussoix D„ J. Mol. Biol., 41, 459- 
472) using known methods (see, for example, Cohen et al. (1972), Proc. Natl. Acad. Sci. U.S.A., 69, 2110- 
2114) and transform ants resistant to 100jig/ml ampicillin selected. Several transform ants were analysed 
further by restriction enzyme cleavage of plasmid DNA, and one full length clone, designated pLF1, 
selected for further use as a cloning vector. At a later stage, an additional Hind III restriction sequence was 

25 created at the EcoRI site.. 

Cloning of the synthetic urogastrone gene in pLF1 : The two portions of the assembled urogastrone 
gene were cloned in two transformation stages, as illustrated in Figure 10 of the accompanying drawings. 

Right-hand portion : The longer fragment of Xba I, Bam Hi-cleaved pLF1 was purified by 
electroelution as above and ligated to a large excess of the .right-hand portion assembly of the 

30 urogastrone gene using 10 units T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM MgCI 2 , 1 mM ATP, 20 
mM DTT at 15 °C for 18 hours. This was transformed into E. coli as before, with selection for 100 jig/ml 
ampicillin. Several transform ants were selected for plasmid analysis by restriction enzyme cleavage 
analysis and one clone, designated pUR1, used for further cloning. The sequence of the inserted 
urogastrone gene DNA was confirmed by chemical degradation analysis (see, for example, Maxam, A., 

35 and Gilbert, W., (1977), Proa Natl. Acad Set U.SA, 74, 560-4). 

In order to have a Hind III site for ligation to the 5' end of the left-hand portion of the gene, the EcoRI 
site was modified as follows : pUR1 was cleaved with EcoRI and the resulting recessed ends filled using 5 
units (1 unit is the amount that incorporates 10 n moles of total nucleotides into an acid-precipitable 
fraction in 30 minutes at 37 °C using poly-d(A-T) as primer according to Richardson, C.C. et al. (1964), J. 

40 Biol. Chem., 239, 222) of DNA polymerase I (EC 2.7.7.7) in 50 mM Tris-HCI, pH 7.5, 10 mM MgCl 2 , 1 mM p- 
mercaptoethanol with 0.25 mM of each deoxynudeotide for 30 minutes at 15°C. 

To the resulting flush ends, were ligated a targe excess of synthetic Hind III linkers (Collaborative 
Research) by the known blunt end ligation procedure (see, for example, Ullrich A. et al. (1977), Science 
196, 1313-1319). The DNA was ligated using 6 units of T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM 

45 MgCI 2 , 1 mM ATP, 20 mM DTT at 15 °C for 18 hours. 

After Hind III restriction cleavage and purification, the full length plasmid was religated and 
transformed into E coli as above, with selection for 100 pjg/ml ampicflfin. A transformant having a Hind III 
site, designated pUR2, was selected for further cloning. 

Left-hand portion : pUR2 was cleaved with Hind HI and Xbal restriction enzymes and the longer 

50 fragment purified by electroelution as above. This was ligated to a two-fold molar excess of the assembled 
left-hand portion of the urogastrone gene using 6 units T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM 
MgCI 2 , 1 mM ATP, 20 mM DTT for 18 hours at 15 °C. This was transformed into E. coli K12 MRC 8 
(genotype dap 103 hsd R met Bl glm 533 upp 1 dap 101 sup E thy A 103 deo rec A1) with selection for 
100jig/ml ampicillin. Several transformants were selected for plasmid analysis by restriction enzyme 

55 cleavage. One clone, designated pUR1 was used for further characterisation and cloning. The sequence 
of the full urogastrone gene was confirmed by chemical degradation analysis. It should be noted that 
pUR1 has no remaining S aureus DNA sequence present 

The present invention also relates to a bacterial cell, in particular an E. coli cell, characterised in that 
it comprises inserted therein a plasmid recombinant as defined above. 

60 The present invention further relates to a process for the production of such a bacterial cell 
characterised in that it comprises inserting a plasmid recombinant as defined above into a bacterial cell. 

Expression of urogastrone in E coli : The urogastrone gene insert was cleaved from pUR1 by Hind 111 
and Bam HI cleavage and purified by polyacrylamide gel electrophoresis and electroelution as above. This 
fragment was ligated to Hind III, Bam Hl-cleaved pWTl21 and pWT221, (see, for example, Tacon, W.C.A. et 

65 al. (1980), Molec. Gen. Genet 177, 427) and the recombinant molecules used to transform E coli MRC 8, 
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(see for example, Emtage, J. S. et al. (1980), Nature, 283, 171-174), with selection for 100 ng/ml ampicilin. 
Transform ants containing full length urogastrone genes were characterised by restriction enzyme 
cleavage analysis and DNA was purified by isopycnic centrifugation in caesium chloride. 

Expression of urogastrone-iike fusion protein was induced by growth of cells in L-broth (luria broth : 

5 1 % (w/v) bacto tryptone, 0.5 % (w/v) bacto yeast extract 0.5 % (w/v) NaCI, 0.2 % (w/v) glucose, 0.004 % 
(w/v) thymine, pH 7) containing lOOjig/ml ampicillin to an A600 nm of 0.3. Following centrifugation, the 
cells were washed and resuspended in M9 medium lacking tryptophan, but containing 20 p£/ml 3 p-indole 
acrylic acid. The cells were incubated at 37 °C for 4 hours. Under these conditions maximal tryptophan 
promoter activity is known to occur (see Tacon et al. loc cit), and hence expression* of the urogastrone 

10 fusion protein. 

The present invention also relates to a process for the production of urogastrone or an equivalent 
thereof or a sub-unit thereof characterised in that it comprises culturing a bacterial cell as defined above 
and recovering expressed protein. 

The present invention further relates to a fused polypeptide characterised in that it comprises 
15 urogastrone or an equivalent thereof or a sub-unit thereof covalerrtly bonded with all or part of a gene of 
the tryptophan operon, e. g. trp E, and in that it is obtained by culturing a bacterial cell as defined above 
and recovering expressed protein. 

As mentioned above, urogastrone has an application in the treatment of ulcers, and also in other 
instances where the growth promoting activity thereof would be beneficial, for example, in wound 
20 healing. Conventional administration forms may be used, the active material being used in an effective 
amount, for example from 0.1 to 1.0 ng/kg body weight, preferably from 0.125 to 0.5 fig/kg body weight, 
more preferably about 0J25 ng/kg body weight, optimally together with a conventional * pharmaceuticaily- 
acceptable carrier, diluent or adjuvant 

25 

Claims 

1. A synthetic gene characterised in that ft codes for the expression of urogastrone or an equivalent 
thereof or a sub-unit thereof in a bacterial ceil and comprises the following sequence or a sub-unit 

30 thereof : 

Coding: 5' A A T T C C G A T A G C G A 6T G T C'C T C T- 
Non-coding: 3' TTAAGGCTATCGCTCACAGGAGA- 
GAGTCACG ATGGTTACTGTCTACATGACGG- 
CTCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTG AG GCTCTAGACAA- 
G C A G A C A T A C A T A T A A C T C C G A G A T C T G T T- 
GTACGCGTGTAATTGCGTTGTTGGCTACAT- 
CATGCGCACATTAACGCAACAACC G" A T G T A- 
CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 
GCCACTCGCGACAGTCATAGCTCTAGACTT- 
ATGGTGGG AACTTAG A- 
TACCACCCTTGAATCT 

2. A synthetic gene as claimed in Claim 1 characterised in that it comprises the following sequence or 
a sub-unit thereof : 

Coding: 5' A G C TT A A A A A G A A TT C C G A T A G- 
Non-coding: 3'ATTTTTCTTAAGGCTATC-* 
CGAGTGTCCTCTGAGTCACGATGGTTACT- 
GCTCACAGGAGACTCAGTGCTACCAATGA- 
GTCTACATG ACGGCGTCTGTATGTATATT. 
C A G A T G T A C T G C C G C A G A C A T A C A T A T A A- 
GAGGCTCTAGACAAGTACGCGTGTAATTG- 
CTCCGAGATCTGTTCATGCGCACATTAAC- 
CGTTGTTGGCTACATCGGTGAGCGCTGTC- 
G C A A C A A C C G A T G T A G C C A C T C G C G A C A G- 
A G T A T C G A G A T C T G A A A T G G T G G G A A C T T- 
TCATAGCTCTAGACTTTACCACCCTTGAA- 

A G A T A A G 
TCTATTCCTAG 

3. A process for the production of a synthetic gene or a sub-unit thereof as claimed in Claim 1 or 
Claim 2 characterised in that it comprises the assembly and ligation of a number of oligoncleotide blocks. 

65 4. A plasmid recombinant characterised in that it comprises a plasmid vector having inserted therein 

7 



35 



40 



45 



50 



55 



60 



EP 0 046 039 B2 



at an appropriate insertion site a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 2, the 
ptasmid recombinant enabling translation in the correct phase for the mRNA corresponding to the 
inserted gene or sub-unit thereof and having a bacterial promoter upstream of and adjacent to the 
insertion site such that the inserted gene or sub-unit thereof is under bacterial promoter control. 
5 5. A ptasmid recombinant as claimed in Claim 4 characterised in that it comprises as the plasmid 
vector pLR 

6. A process for the production of a plasmid recombinant as claimed in Claim 4 characterised in that 
it comprises inserting a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 2 into an 
appropriate insertion site of an appropriate plasmid vector. 
10 7. A bacterial cell characterised in that it comprises inserted therein a plasmid recombinant as 
claimed in Claim 4 or Claim 5. 

8. A process for the production of a bacterial cell as claimed in Claim 7 characterised in that it 
comprises inserting a plasmid recombinant as claimed in Claim 4 or Claim 5 into a bacterial celL 

9. A process for the production of urogastrone or an equivalent thereof or a sub-unit thereof 
15 characterised in that it comprises culturing a bacterial cell as claimed in Claim 7 and recovering 

expressed protein. 

10. A fused polypeptide characterised in that it comprises urogastrone or an equivalent thereof or a 
sub-unit thereof covalentry bonded with all or part of a gene of the tryptophan operon and in that it is 
obtained by culturing a bacterial cell as claimed in Claim 7 and recovering expressed protein. 

20 11. A fused polypeptide as claimed in Claim 10 wherein the gene of the tryptophan operon is the trp E 
gene. 



Patentanspruche 

25 

1. Syrtthetisches Gen, dadurch gekennzeichnet, daB es die Expression fur Urogastron Oder ein 
Aquivalent dessetben oder eine Untereinheit desseiben in einer Bakterienzelle codiert und die folgende 
Sequenz oder eine Untereinheit derseiben umfaBt : 

30 codierend : 5' A A T T C C G A T A G C G A G T 6 T C C T C T- 

nicht cocDerend :3' TTAAGGCTATCGCTCACAG GAGA- 
G A G T C A C G A T G G T T A C T G T C T A C A T G A C G G- 
CTCAGTGCTACCAATGACAGATGTACTGCC- 

CGTCTGTATGTATATTGAGGCTCTAGACAA- 
35 GCAG ACATACATATAACTCCGAG ATCTGTT- 

GTACGCGTGTAATTGCGTTGTTGGCTACAT- 
C A T G C G C AC ATTAACG CAACA AC C G A T G T A- 
CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 
GCCACTCGCGACAGTCATAGCTCTAGACTT- 
40 ATGGTGGGAACTTAGA 

TACCACCCTTGAATCT 

2. Syrtthetisches Gen nach Anspruch 1, dadurch gekennzeichnet daB es die folgende Sequenz oder 
eine Untereinheit derseiben umfaBt : 

45 

codierend :5' AGCTTAA AAAG AATTCCGATAG- - 
nicht codierend : 3' ATTTTTCTTAAG GCTATC-* 

CGAGTGTCCTCTGAGTCACGATGGTTACT- 
GCTCACAGGAGACTCAGTGCTACCAATGA- 

50 GTCTACATGACGGCGTCTGTATGTATATT- 

CAGATGTACTGCCGCAGACATACATATAA- 
G A G G C T C T A G A C A A G T A C G C G T G T A A T T G- 
CTCCGAGATCTGTTCATGCGCACATTAAC- 
CGTTGTTGGCTACATCGGTGAGCGCTGTC- 

55 G C A A C A A C C G A T G T A G C C A C T C G C G A C A G- 

A G T A T C G A G A T C T G A A A T G G T G G G A A C T T- 
TCATAGCTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

60 

3. Verfahren zur Erzeugung eines synthetischen Gens oder einer Untereinheit desseiben nach 
Anspruch 1 oder Anspruch 2, dadurch gekennzeichnet, daB es den Zusammenbau und die Verknupfung 
einer Anzah! von Oligonucleotid-Blficken umfaBt. 

-4. Rekombinantes Plasmid, dadurch gekennzeichnet, daB es einen Plasmid-Vektor umfaBt, der ein 
65 synthetisches Gen oder eine Unterheit desseiben nach Anspruch 1 oder Anspruch 2, eingefugt an einer 
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geeigneten insertionssteile, aufweist, wobei das rekombinante Plasmid die Translation in der korrekten 
Phase fur die dem eingefugten Gen oder der eingefugten Untereinheit desseiben entsprechende mRNA 
ermdglicht und einen solchen Bakterien-Promotor von (stromaufwarts von) der Insertionssteile und 
dieser benachbart aufweist, daB das eingefugte Gen oder die Untereinheit desseiben der Steuerung 
5 durch den Bakterien-Promotor unteriiegen. 

5. Rekombinantes Plasmid nach Anspruch 4, dadurch gekennzeichnet daB es als den Plasmid- 
Vektor pLFI enthaJL 

6. Verfahren zur Erzeugung eines rekombinanten Plasmids nach Anspruch 4, dadurch gekennzeich- 
net, daB es das EnfOgen* elnes synthetischen Gens oder einer Untereinheit desseiben nach Anspruch 1 

10 oder Anspruch 2 an einer geeigneten Insertionssteile eines geeigneten Plasmid-Vektors umfaBt 

7. Bakterienzelle, dadurch gekennzeichnet, daB sie darin eingefOgt ein rekombinantes Plasmid nach 
Anspruch 4 oder Anspruch 5 enthaJt 

8. Verfahren zur Erzeugung einer Bakterienzelle nach Anspruch 7, dadurch gekennzeichnet, daB es 
das Einfugen eines rekombinanten Plasmids nach Anspruch 4 oder Anspruch 5 in eine Bakterienzelle 

15 umfaBt 

9. Verfahren zur Erzeugung von Urogastron oder eines Aquivalents desseiben oder einer Unterein- 
heit desseiben, dadurch gekennzeichnet daB eine Bakterienzelle nach Anspruch 7 kuttMert wird und das 
exprimierte Protein isoliert wird. 

10. Fusioniertes Polypeptid, dadurch gekennzeichnet, daB. es Urogastron oder ein Aquivalent 
20 desseiben oder eine Untereinheit desseiben kovaient gebunden mit dem gesamten Gen oder einem Teii 

des Gens des Tryptophan-Operons umfaBt und daB es durch KuWvieren einer Bacterienzeile nach 
Anspruch 7 und Isolierung des exprimierten Proteins erhaiten wird. 

11. Fusioniertes Polypeptid nach Anspruch 10, dadurch gekennzeichnet daB das Gen des Trypto- 
phan-Operons das trp-E-Gen ist 
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Revendications 



1. Gene synthetique caracte>is6 en ce qu'il code pour ('expression d'urogastrone ou un equivalent de 
30 cette demiere ou une sous-unite de cette demiere dans une cellule bacterienne et comprend la sequence 
sutvante ou une sous-unite de cette demiere : 

* 

Codant: 5' AATTCCG ATAGCG A6TGTCCTCT. 
Non-codant: 3'TTAAGGCTATCGCTCACAGGAGA* 
35 GAGTCACGATGGTTACTGTCTACATGACGG- 

CTCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAGGCTCTAGACAA- 
GCAGACATACATATAACTCCGAGATCTGTf- 
GTACGCGTGTAATTGCGTTGTTGGCTACAT- 
CATGGGCACATTAACGCAACAACCGATGTA. 
CGGTG AGCGCTGTC AGTATCG AGATCTG AA- 
GCCACTCG CGACAGTCATAGCTCTAGACTT- 
ATGGTGGG AACTTAGA 
TACCACCCTTG AATCT. 



2. Gene synthetique selon la revendication 1, caracterisS en ce qui! comprend la sequence suivante 
ou une sous-unite de cette demiere : 



Codant: 5' AGCTTAAAAAG AATTCCGATAG- 
Non-codant: 3' ATTTTTCTTAAGG CTATC- 
CGAGTGTCCTCTGAGTCACGATGGTTACT- 

GCTCACAGGAGACTCAGTGCTACCAATGA- 
GTCTACATGACGGCGTCTGTATGTATATT- 
C A G A T G T A C T G C C G C A G A C A T A C A T A T A A- 
« GAGGCTCTAGACAAGTACGCGTGTAATTG- 

CTCCGAGATCTGTTCATGCGCACATTAAC- 
CGTTGTTGGCTACATCGGTGAGCGCTGTC* 
G C A A C A A C C G A T G T A G C C A C T C G C G A C A G- 
A G T A T C G A G A T C T G A A A T G G T G G G A A C T T- 
TCATAGCTCTAGACTTTACCACCCTTGAA- 

A G A T A A G 
TCTATTCCTAG 



3. Procede de production d'un gene synthetique ou d'une sous-unite de ce dernier selon la 
65 revendication 1 ou la revendication 2, caracterise en ce qu'il comprend ('assemblage et la liaison d'un 
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certain nombre de fragments ^oligonucleotides. 

4. Plasmide recombinant caracterise en ce qu'il comprend un plasmide vecteur dans iequel est 
insere en un site dlnsertion approprie un gene synth&ique ou une sous-unite de ce demier selon la 
revendication 1 ou la revendication 2, le plasmide recombinant permettant la traduction en la phase 
5 correcte pour I'ARN-m correspondant au gene insert ou a la sous-unite de ce demier et ayant un 
promoteur bacterien en amont et adjacent par rapport au site dlnsertion, tel que le gene insure ou la 
sous-unite de ce demier se trouve sous le contrdle du promoteur bacterien. 

. 5. Plasmide recombinant selon la revendication 4, caracteris§ en ce qu'il comprend en tant que 
plasmide vecteur pLFI. 

10 6. Proceed de production d'un plasmide recombinant selon la revendication 4, caracterise en ce qu'il 
comprend ('insertion d'un gene syntheiique ou d'une sous-unite* de ce demier selon la revendication 1 ou 
la revendication 2 dans un site dlnsertion approprie d'un plasmide vecteur approprie. 

7. Cellulose bacterienne caracterisee en ce que s'y trouve insere un plasmide recombinant selon la 
revendication 4 ou la revendication 5. 

15 8. ProcedS de production d'une cellule bacterienne selon la revendication 7, caracterise en ce qu'il 
comprend Unsertion d'un plasmide recombinant selon la revendication 4 ou la revendication 5 dans une 
cellule bacterienne. 

9. Precede de production cf urogastrone ou d'un equivalent de cette demiere ou d'une sous-unite de 
cette demiere, caracterise en ce qu'il comprend ia culture d'une cellule bacterienne selon la 

20 revendication 7 et la recuperation de la proteine exprimee. 

10. Polypeptide fusionne caracterise en ce qu'il comprend f'urogastrone ou un equivalent de cette 
demiere ou une sous-unite de cette demiere liee a tout ou partie d'un gene de I'operon tryptophane et en 
ce qu'il est obtenu en cultivant une cellule bacterienne seion la revendication 7 et en recuperant ia 
proteine exprimee. 

25 11. Polypeptide fusionne selon la revendication 10, caracterisS en ce que le gene de I'operon 
tryptophane est le gene trp E 
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-10 -5 

Met Gin Thr Gin Lys Pro Thr Pro Ser Ser Lys 
codrig ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 



Non-coding T T C 



1 5 
Leo Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTGTCCTCTG 



GAATTTTTCTTAAGGCTATCGCTCACAGGAGAC 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGTCTACATGACGGCGTC 



TCAGTGCTACCAATGACAGATGTACTGCCGCAG 



20 25 30 

Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 



ACATACATATAACTCCGAGATCTGTTCATGCGC 



Fi g . 1 (Part 1 of 2) 
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35 *0 
T C ?T a*Tt T C ?CG V ? l T G V f TG% T^Ca'^C G^T G^GC^C 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 



Cvs Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
T GTCAGTATCGAGATCTGAAATGGTGGGAACTT 



ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 



55 

Arg Ter Gly Ser 
AGATAAGGATCC coding 



TCTATTCCTAGG Non-coding 



Fig .1 (Part 2 of 2) 
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Hindlll 
„ Alul 
-10 -5 
Met Gin Thr Gin Lys Pro Thr Pro Ser Ser Lys 

atgcaaacacaaaaaccgactccaagctc c A AG 



TT C 

I 



It) M 
♦ • • 



EcoRI _ Mnll 

1 5 
Leu Lvs Lys Asn Ser Asp Ser Gtu Cys Pro Leu 
C T T AAA A AG A A T T C C G AT A G CGAGTGT CC TC TG 

••••••• 



GAATTTTTCTTAAGGCTATCGCTCACAGGAGAC 

»■» » 



Hinf I Acc I Hgal 

10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTG T*C TACATGACGGCGTC 



TCAGTGCTACCAATGACAGATGT.ACTGCCGCAG 
*— I* I'" Hgal ***** 



* * 



Mnll • Xbal FnuOII 
20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 

I ....... 



ACATACA TATAACTC CGAGAT CTGTTCATGCGC 

Hhal 



Fig . 2 (Part 1 of 2) 



3 



EP 0 046 039 B2 



HphI Haeli 
35 i° 



Cvs Asn Cvs Val Val Giy Tyr He Glv Glu Arg 
I 6 T AA T 1 G C G T T G T T G oC T A CATC G 6 T G A G C 5 C 



I 

ACATTAACGCAACAACCGAT GT AGCC AC TC GCG 

• f 

Mbol 

TaqI Bglll 

45 50 
Cys Gtn Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
T G T C+A G T A TCGAGATCTGAAATGGT GGGAAC T T 

•I I 
•I 

A C A*GTCAT AGCTC TAGACTT T ACCACCC TT G A A 

HphI 1 J 

V 



Bam HI 
Mbol 

Arg Ter Gly Ser 
AGAT AAGGATCC 

I 

I 



• • • • • 



T CTAT T CCT AGG 

r 

i 



Fig . 2 (Part 2 of 2) 
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coding A G 



Non-coding — 



1 5 
Leu Lys Lys Ash Ser Asp Ser Glu Cys Pro Leu 

CTTAaAAaGAAT t c c g at a g^c g a g t g tcc tctg 

"""7(13)" HI (18) 



— A TTTTTCTTAAGGCT AT CGCTCACAGGAGAC 

j 1 - 

1105) IV (18) 



10 15 

Ser His Asp Gly Tyr Cys Leu His, Asp, Glv Val 
AGTCACGATGGTTACTGTCTACATGACGGTGTC 

V "(18) ~ VH "(18) 

TCAGTGCTACCAATGACAGATGTACTGCCGCAG 

— I— I - 

VI (18) VIII (18) 



20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 

I 1 

IX (12) XI (12) I 

XIII (18) 

ACATACATATAACTCCGAGA T C TGTTCATGCGC 

X (18) I " 

XII (20) 



Fig. 3 (Part 1 of 2) 
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35 40 
Cys Asn Cys Val Val Gly Tyr He Gly Glu Arg 
TGTAATT GCGTTG TTGGCTACATCGGTGAGCGC 

1 XV (16) XVU 03) 



ACATTAACGCAACAACC G A TGTAGCCACTCGCG 
1 1 

XII (20) XIV (16) XVI (12) 



A5 50 
Cys Gin Tyr Arg Asp Leu • Lys Trp Trp Glu . Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 

1 1" 

XIX (15) XXI (12) XXIII (14) 

♦ 

ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 

I — -I - I- 

XVIII OA) XX (14) 



Arg Ter 

AG AT A AG Codina 

1 ' 

XXI 1 1 (14) 



I.5.L..LIfi.IA? Non-coding 
XXII (12) 



Fig. 3 (Part2of 2) 
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coding A G 



Non-coding- - 



5 



C L f T aT* A*> A^T #C #T A^C 
........ , m()8) 

..-ATTTTTCTTAAGGCTATC6CTCACAGGA6AC 



At <m <R?t A'c t c g*t & clfr m 

7w) ' vll,<18 ' 

tcagtgctaccaatgacagatgta c J G _C C AC A 0 

jtom • v,1, * 08> 



25 30 
kr i? 0 tYt a"t< C G G A U A G A !?T fofo A*0 # C 0% 

"u"(S7" '""i* 1 " ' ;m"o» 

ACATACATATAGCTTCGAGA_T_CTGTTCATGCGC 



Pig 4 (Part 1 of 2) 



7 



EP 0 046 039 B2 



35 AO 
Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg 
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